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© A method for the preparation of high density peptide (or other polymer) libraries, and for screening such 
libraries for molecules having the capacity to recognize targets of choice, is provided. 
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The present invention describes a novel method for preparation of high density peptide {or other 
polymer) libraries, and for screening such libraries for peptide having the capacity to recognize targets of 
choice. 

Many biological phenomena are known to involve the interaction of peptides with a macromolecular 
5 target, such as a protein. Such peptides include the vasoactive intestinal peptide, the angiotensins, and the 
endothelins. 

For such an interaction to occur, the peptide must be able to fold into a conformation in which it 
presents a surface which is complementary to a critical region, e.g., a catalytic pocket, of the target 
molecule. While short peptides that interact with proteins or other biomolecules are used in research and 

io clinical therapy (Magazine. 1991; Bischoff, 1992; Baumbach, 1992), the rational design of peptides to bind 
to new targets, or to enhance or render more specific their binding to a known target, is difficult. 

In peptides, the number of total chemical structures is determined by the number of different amino 
acids used and by the peptide length, e.g., from 20 amino acids it is possible to synthesize 20 9 different 
peptides with a length of 9 amino acids. 

J5 If one cannot rationally predict the peptides which will have the desired binding activities, it is desirable 
to be able to screen, simultaneously, a large number of peptides of different sequences, which are prepared 
and presented in a manner which facilitates identification of binding peptides. Such a collection of peptides 
is called a "peptide library." (Birbaum, 1992; Amato, 1992). 

One method for construction of a peptide library is by genetic engineering (Scott 1990, Cwirla 1990, 

20 Devlin 1990). Peptides are expressed as part of a surface protein on the pill protein on the outer surface of 
a filamentous phage. Phages reacting specifically with a target of choice are selected by panning and 
expanded. The relevant DNA sequence of the selected phages is then determined. The identified peptide 
sequences are deduced from the identified DNA sequences. The advantages of this approach to peptide 
library are speed and convenience. The major disadvantage is that the peptide library only contains linear 

25 (non-branched) peptides composed of the 20 native, L-amino acids. 

Another approach for the generation of a peptide library is chemical synthesis. The major advantage of 
the chemical approach is the ability to produce peptides whose composition is not limited to the twenty 
genetically encoded amino acids. The use of a large number of different amino acids increases the extent 
of diversity it is possible to obtain from short peptide sequences. The chemical approach also facilitates the 

30 synthesis of cyclic and branched peptides. Besides increasing the diversity of the library, the use of non- 
native amino acids enables better control of peptide properties: e.g., lipid solubility of peptides may be 
largely increased by use of sulfoxide derivatives of methionine. 

The "addressable library" approach, practiced by Affimax (Fodor 1991) is as follows: peptides are 
synthesized in squares as small as 10x10 urn on a piece of glass. The peptide sequence formed in each 

35 square is known by virtue of its position. On a surface of about 1 cm 2 it is possible to pack as many as 
100x100 = 10,000 different peptides. The peptides are reacted with a fluorescent ligand and the stained 
squares are identified under the microscope. In this way it is possible to immediately identify peptides 
binding to a specific ligand. The major disadvantage of this approach is the relative small number of 
peptides it is possible to screen. 

40 In the variation presented by Houghten (1991), hexapeptide mixtures were synthesized from 18 L-native 
amino acids. Position 6 corresponds to the C terminal and position 1 corresponds to the N terminal. A 
complete mixture of all 18 amino acids was introduced in each of positions 3-6. The peptide mixture was 
then separated into 18x18-324 different tubes, and in each tube a specific dipeptide was introduced in 
positions 1-2. The 324 peptide populations were screened for activity (e.g., inhibition of antibody-antigen 

45 interaction) and the most active peptide mixture was identified. Next, 18 new peptide mixtures were 
synthesized. In positions 1 and 2 all of the 18 new peptide mixtures contained the dipeptide identified in the 
previous step. The 3rd position of each of the mixtures had a single amino acid. Positions 4-6 contained a 
mixture of all 18 amino acids. The 18 peptide mixtures were screened for activity and the best reacting 
mixture was selected for further characterization. The process was repeated until all 6 positions in the 

so peptide were identified. 

More recently, Houghten (Oral presentation and abstract, European Peptide Society (EPS) 92 sympo- 
sium, Interlaken, Switzerland) suggested a different approach. Starting from 18 amino acids, a total of 
18x6 = 108 peptide mixtures were synthesized. In 18 mixtures, position 6 contained a unique amino acid, 
and positions 1-5 contained a mixture of all amino acids. In another 18 mixtures position 5 contained a 

55 unique amino acid and all other positions contained a mixture of all 18 amino acids, etc.... Once 
synthesized, all the 108 peptide mixtures were tested simultaneously and the most active mixture out of 
each of the 18 mixture representing each position was identified. The desired sequence was thus identified 
in a single day. 
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The major disadvantage of both Houghten approaches is the limited ability to incorporate a large 
number of different amino acids. When the number of different amino acids is increased, the relative 
amount of each unique peptide within the mixture is reduced and thus its effects become more difficult to 
identify. Houghten compensates by testing his peptide mixtures for activity at a concentration of 5 mg/ml. 
5 However, testing libraries constructed of more than 40 or so different amino acids would be difficult to 
conceive. 

The present invention overcomes the aforementioned deficiencies of the Background Art. In particular, it 
permits the screening of much larger peptide libraries. 

Conventionally, when a peptide library is synthesized on beads, on any given bead , all of the peptide 

w molecules have the same sequence. Consequently, as previously explained, the diversity of the peptide 
library is limited by the total number of beads employed, which in turn is limited by human factors. We 
have estimated this limit to be on the order of 10* different beads, and hence 10 s different peptide 
sequences. By the method of the present invention, it is believed that the delivery of the library may be 
increased by as much as seven orders of magnitude , i.e., to as many as 10 15 different peptide sequences. 

15 In the present invention, each bead bears, not a single peptide sequences, but a single family of related 
peptide sequences. {Normally, the family trait is a common (or low degeneracy) amino terminal portion of 
one or more amino acids.) The peptide library, in turn, includes many different families of peptides, with 
each family being found on one or more beads. Because the peptide library is arranged so that the peptide 
complement of each bead is constrained, the library is said to be "structured." This structured library is 

20 then subjected to a round of screening. 

If a bead is marked by an affinity reagent, it indicates that one or more of the peptides in its family are 
bound by the affinity reagent. The peptide mixture on the bead is then sequenced to determine the 
common (or low degeneracy) amino terminal portion, the familial "marker." 

In the next round of screening, a sublibrary of the library of the prior round is constructed, in which all 

25 peptides possess the familial marker of the successful family in the last library. Each bead of this new 
library carries only peptides belonging to a subfamily of the aforementioned family. When this sublibrary is 
screened with an affinity reagent, the beads which are bound are those whose subfamilies include a binding 
peptide. The process is then repeated, with each successful family of the library of one screening round 
becoming, in the next round, a new library, which in turn is divided into families. Eventually, the entire 

30 sequence of the binding peptide is known. 

While, for convenience, the description refers to synthesis, screening and sequencing of peptide 
libraries, it applies, mutatis mutandis , to libraries displaying other heteropolymers whose ability to bind 
specifically to a target is related to their specific sequence of monomeric units. Such polymers include 
peptoids, nucleic acids, and carbohydrates. It should further be noted that the term "polymer" is intended to 

35 include "oligomers". 

The present invention involves preparation of a peptide (or other polymer) library in which a highly 
diverse collection of peptides are synthesized on beads by solid phase peptide synthesis techniques and 
then presented to potential targets. The library is structured so that each bead itself offers a detectable 
number of molecules of essentially each of a family of different yet related peptides. Because of this family 

4o relationship, once a "bead" is identified as "positive" by affinity screening, one or more amino acids of the 
part of the sequence which is "common" to all peptide sequences borne by that bead can be identified. A 
new peptide library is then prepared whose members correspond to the "positive family" of the prior 
library. This daughter library is in turn structured into families, one family per bead, so that screening and 
sequencing lead to the identification of additional residues of the actual binding peptide(s). The process is 

45 continued until the binding peptides have been fully sequenced. 

The number of different peptides of length k which are possibly synthesized from N different amino 
acids is N K . The number of beads it is possible to use per single library is practically limited by the amount 
of the peptides we are willing to synthesize and by our ability to screen the library. Each ml of packed 
beads contains from several hundred thousand to several millions of beads. Manually, we might screen a 

50 library of about 100 ml of beads, containing about 10 7 - 10 9 beads. If, as is conventional, each bead carried 
a single peptide, the number of beads in the library would be sufficient to screen for all the hexapeptides 
which could be synthesized from up to 30 different amino acids (720x10 s ). Screening of all the hexapep- 
tides which are possibly synthesized from larger number of amino acids would be technically difficult or 
impossible. 

55 The amount of peptide found on a single bead of about 100 urn diameter can be about 100 pmole, or 
about 6x1 0 13 molecules. This number of peptide molecules is much larger than the number of molecules 
needed for assaying of ligand binding to the beads. Using enzyme detection or fluorescence detection 
methods it is possible to monitor the binding of antibodies with binding constants of about K A =10 9 to as 
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few as 1000 receptor molecules appearing on the cell membrane of mammalian cells with diameter of about 
10 urn. The diameter of the beads used for synthesis of the peptide is about 100 urn and so their volume is 
about (1 00/1 0) 3 = 1000 times larger than mammalian cells. We therefore conclude that using enzyme or 
fluorescence detection methods we should be able to monitor the binding of ligands with K A = 10 9 to beads 

5 containing at least 1000x1000 = 10 6 target molecules. Since the beads contain about 6x10 13 peptide 
molecules we concluded that we can pack each bead with many peptides. In fact, theoretically, we could 
pack GxlO^/IO 6 = 6x10 7 different peptide molecules per bead and still be able to detect the binding of a 
ligand to a single peptide type. 

Once a certain bead is identified as containing an interesting peptide we need to determine the peptide 

10 structure. This is achieved by N-terminal sequencing (Edmann degradation). The procedure is routinely 
carried out by an automatic machine. Following degradation the resulting PTH-amino acid is analyzed by 
reverse phase chromatography. The state of the art machines are capable of analyzing the sequence of 
about 10 pmole of peptide. Thus, the 100 pmole of peptide found on each bead are ample amount for 
sequencing. However, as compared to the immunological staining process the sequencing step is relatively 

js insensitive. Thus, even though we could synthesize on each bead over 10 7 different peptides and still be 
able to select beads based upon the interaction of the ligand with the peptides on the beads, using current 
machinery we would not have been able to retrieve the sequence information. In fact, even if we 
synthesized only 20 different peptides on a single bead, the amount of amino acid obtained by each 
Edmann degradation step would have been only 100/20 = 5 pmole, which is too low to allow precise 

20 analysis of the sequence. 

We suggest that in accordance with the present invention it is possible to tackle the problem of 
elucidating the structure of an active peptide, in a library expressing many peptides per bead, in an iterative 
manner. According to our strategy, each step of the iteration is designed to allow identification of the 
desired bead via interaction to one or several of the peptides expressed on the bead, and at the same time 

25 enable the at least partial determination of the identity of one or more amino acids of the peptide. Several 
iterations are needed in order to allow the complete elucidation of the desired structure. 

Target 

30 The purpose of constructing a peptide library is to identify peptides which bind to a target of interest. 

The target may be any kind of substance, whatsoever. It may be inorganic or organic; crystalline or 

amorphous; micromolecular or macromolecular; naturally occurring or artificial. Typical targets include 

proteins (including enzymes, hormones and receptors), carbohydrates, and lipids. Suitable targets include 

human tumor necrosis factor (or its p55 and p75 receptors), and interleukin - 6, its cefl surface receptor, the 
35 IL - 6/receptor complex, and its transducer, gp130. 

The novel binding molecules which may be obtained from peptide (or other polymer) libraries, include the 

following (the categories recited in italics are not mutually exclusive): 

Molecules that inhibit cell-cell or cell substratum recognition. Possible applications could be: 

inhibition of tumor metastasis formation, and inhibition of platelet aggregation. The celt surface contains 
40 groups of molecules that mediate binding of one cell to another or of cells to extracellular matrix 

components. These molecules are e.g. the integrins. See Ferguson, T.A., Mizutani, H. and Kupper, T.S. 

"Two integrin-binding peptides abrogate T cell-mediated immune responses in vivo", Proc. Natl. Acad. 

Sci. USA 88:8072-8076 (1991); Skubitz, A.P.N. , Letourneau, P.D., Wayner, E. and Furcht, L.T., "Synthetic 

peptides from the carboxy-terminal globular domain of the A chain of laminin: Their ability to promote cell 
45 adhesion and neurite outgrowth, and interact with heparin and the b1 integrin subunit," J. Cell Bioi 

115:1137-1148 (1991); Hynes, R.O., "Integrins: Versatility, modulation, and signaling in cell adhesion." Cell 

69:11-25 (1992). 

Peptides capable of inhibiting the cellular activities mediated by the integrins could be discovered as 
follows: An integrin would be cloned. The recombinant protein would be produced and purified. The purified 

so proteins would be labeled with biotin and used to screen the library. Avidin conjugated with alkaline 
phosphatase would be used for staining of the beads which bind the biotinylated proteins. Following 
identification of the peptides that bind to the proteins, the peptides would be synthesized in soluble form 
and tested for their ability to modify the desired biological activity, e.g., as described in the cited papers. 
Inhibitors of viral adhesion to cell surface receptors, e.g. molecules that will mimic the activity of 

55 the soluble CD4. The membrane bound form of the CD4 serves as a receptor for HIV1. Discovery of 
peptides capable of inhibiting the binding of viruses to cells could be done in one of two approaches. In one 
approach we could use complete virions. The binding of the virions to the beads could be monitored by 
using antibodies specific to the virus. Once the structure of the peptide binding to the virus is know it could 
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be synthesized in a soluble form and tested directly in viral inhibition assays. In a second approach, viral 
proteins known of mediating the binding of the virus to the cell could be cloned, expressed and purified and 
used as described above for section 1. 

Inhibitors of viral- specific enzyme activities. 
s -- Inhibition of viral protease activity. 

-- Binding to and inhibition of the viral reverse transcriptase activity. 
Bactericidal or bacteriostatic molecules: 

Molecules that would produce a hole in bacterial membranes 
- Molecules that would block the bacterial protein synthesis machinery by interaction with the bacterial 
10 ribosomes. The screening approach would be to test the binding of purified bacterial ribosomes (or 
polyribosomes) or any other protein which is participating in the synthesis of polypeptides (e.g. 
enzymes) to the beads and later test the activity of the synthesized soluble peptides in inhibition of the 
ribosome activity. 

-- Molecules that would interfere with bacterial cell wall construction. In this case we would screen for 
75 beads containing peptides that bind to at least one of the enzymes participating in cell wall synthesis. 
Later the identified peptides would be synthesized in soluble form and tested for their ability to inhibit the 
enzyme activity and consequently the cell wall synthesis. 

-- Molecules that would inhibit the adsorption of the bacteria to specific targets. In this case we shall look 
for beads containing peptides that bind whole bacterial cells or, more preferably, cloned and purified 
20 bacterial cells known as participating in recognition of bacterial targets. In a second step the identified 
peptides would be tested for their ability to inhibit bacterial binding to the target. 

-- Molecules that would interfere with bacterial DNA synthesis. We shall look for peptides which bind to 
cloned and purified enzyme which participate in DNA synthesis or in production of DNA precursors. 
Inhibitors of bacterial exo-and endotoxins. The approach would include finding of peptides that bind 
25 to the toxin and later testing the ability of the identified peptide to inhibit the toxic activity. 

Molecules that have enzymatic activity. We may screen for such peptides with a colorimetric (color 
generating) assay for the enzyme where the final product of the reaction would be insoluble, thereby 
staining the beads, e.g. Reduction of NAD to NADH could be monitored by binding on the beads of enzyme 
capable of using the reduced NADH to reduce a tetrazolium slat to insoluble colored formazan. Detection of 
30 other type of reactions may necessitate coupling of several enzymatic reactions until the desired color 
product is obtained. See Tawfik, DS„ Green, B.S , Chap, R., Sela, M. and Eshhar. Z. "catELISA: A facile 
general route to catalytic antibodies." Proc. Natl. Acad. Sci USA 90: 373-377, (1993). 

Inhibitors of enzymatic activity, e.g. of proteolytic enzymes. We shall first look for peptide capable 
of binding to the enzyme of choice. The peptides would then be synthesized in a soluble form and their 
35 ability to modify the enzymatic activity would be determined. 

Molecules that would modify enzyme activity in an alios teric manner. 

Molecules that would bind DNA at specific sequences or sites and inhibit transcription. For 
screening, beads could be stained by specific DNA segments labeled with biotin or an enzyme. In a second 
stage, soluble peptides could be tested for their ability to modify transcription. Binding at a specific site 
40 may indicate binding at loops, hairpins or other structures. 

Molecules that interfere with the interaction of proteins with nucleic acids by interaction with 
the proteins. Screening for the ability of the peptide to interfere with the interaction of the protein with DNA 
or RNA could be done in the second stage, once peptide capable of binding to the protein are identified. 

Molecules that serve as adjuvants in vaccines. Such structures may be used alone or as parts of 
45 constructs that express both the antigen and adjuvant on a single molecule. 

Molecules that serve as vaccines, i.e., molecules that mimic antigenic epitopes of natural 
antigens. The current approach for preparation of peptide vaccines is to prepare peptides that contain B 
and T ceil determinants of the antigenic protein. The preferred T cell determinant(s) are promiscuous 
(reactive with many MHC isotypes), so as to allow generation of immune response in as large proportion as 
so possible of the population. The difficulty is that the peptides representing the major antigenic determinants 
are not necessarily immunogenic when removed from the protein. We would use antibodies generated 
against the antigen as targets in order to identify peptides capable of binding with the antibodies. These 
selected peptides would mimic the immunogenic structure of the antigenic protein. Thus, the approach 
would allow preparation of vaccines from parts of the protein which are immunogenic when the protein is 
55 intact, but not in isolation. In a second step we would couple the identified peptides to a T cell determinant 
and test their ability to serve as immunogens. 

Molecules that interact with the T cell receptor and serve for induction of T cell suppression. 
Peptides that bind to the T cell receptor could bind at the active (recognition site) or outside of it. on the 
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other part{s) of the receptor polypeptide chains. Initial screening could be for peptides that bind to purified 
receptor. A second screening could verify if the bound peptide has the ability to activate specific T clone 
(antigen mimetic) to activate many type of T cells (superantigen mimetic) or to inhibit the activation of the T 
cells by known T cell determinants. See Drake, D.G. and Kotzin, B.L., "Superantigens: Biology, immunol- 

5 ogy, and potential role in disease," Journal of Clinical Immunology 12:149-162 (1992); Esser, U. and 
Parham, P. "Superantigens: Playing upon both sides," Nature 359:19-20 (1992). 

Molecules that interfere in the antigenic presentation of other molecules by interacting with 
surface receptors on the antigen presenting cells (APCs). It is presently believed that protein antigens 
undergo cleavage into small peptides in the APC. These processed peptides bind to MHC (major 

w histocompatibility antigen) molecules at the cell surface and are thus presented to T cells. Some types of 
MHC molecules bind the peptide during their own synthesis and migrate with the bound peptides to the cell 
surface. In order to compete with this mechanism we would need an inhibitory peptide which penetrates 
into the cells to the specific site where "loading" of the MHC molecule with peptides occurs. 

Other types of MHC molecules bind peptide extra-cellularly. The binding of these peptides could be 

15 competitively inhibited with peptides in the circulation. The current dogma states that the binding of the 
peptide to the MHC is via two or more anchoring residues (sites) of the peptide (e.g. the second residue 
from the amino terminal and the free carboxy terminal) to specific sites on the MHC molecule. Theoretically, 
we could design stronger binding peptides that would prevent binding of the natural peptides. The 
application would be mainly in controlling autoimmune disorders. 

20 Molecules that enhance the immunogenicity of other molecule by targeting them to antigen 
presenting cells. Many peptide epitopes are not good immunogens since they can not bind to MHC 
molecules and are thus not "presentable". It is possible that if such peptides would be coupled to peptide 
mimetics that would bind the MHC molecules the target peptides would become immunogenic. The library 
approach can be used for discovering peptides that bind to the different MHC isotypes. As stated above, 

25 MHC molecules bind peptides via "anchoring" residues/sites. We could discover peptide mimetics that bind 
to the MHC via non-conventual structures/sites that would be more efficient as compared with pure 
peptides. 

Molecules that inhibit the IgE mediated immediate type hypersensitivity response by preventing 
the occupation of the FC receptor by specific IgE antibodies. During the screening we would look for 

30 peptides capable of binding to the Fc, receptor. This receptor is responsible for binding of the IgE. When 
the IgE which is bound by the receptor encounters the antigen, the cell bearing the receptor is activated. 
Activation of such cell is the primary element in the immediate type hypersensitivity also know as allergic 
response. If we could prevent the cells from binding the IgE we could abrogate the activation of the cells by 
the antigen thereby preventing the allergic response. In a first step we could use the library to look for 

35 peptides binding to the Fc. receptor. In a second step we would test the ability of the identified peptides to 
block the binding of the IgE to the receptor. The main application of such peptides would be the prevention 
of allergic reactions. 

Molecules that inhibit the binding of complement components to immune complexes thereby 
inhibiting complement activation. We can screen the library for peptides that bind to immunoglobulin 

40 which participate in formation of immune complexes. The Fc of these immunoglobulins is conformationally 
modified as compared to the Fc of free immunoglobulin. Only the Fc of immunoglobulins participating in 
immune complex formation is capable of activating the complement. We could screen the library for 
peptides that bind to the Fc of immunoglobulins in an immune complex. In a second step we could test the 
identified peptides in a soluble form for their ability to inhibit complement activation. 

45 Molecules that bind to soluble immune complexes and prevent their accumulation in the kidneys. 
Immune complexes tend to accumulate in the basal membrane of the kidney. Non-specific activation of 
complement at the site may eventually cause kidney failure. Some of the peptides that bind to the immune 
complexes (via the Fc of the participating immunoglobulins) may inhibit the specific binding of the immune 
complexes to the basal membranes. Alternatively, peptides may be found that prevent the activation of the 

so complement by the immune complex by preventing the binding of the complement component C3 to the 
immune complex. 

Molecules that serve as target antigens or pseudo- antibodies in immunoassays. Peptides may 
replace antigens or antibodies. Replacing of antigenic reagents used in immunoassays for the presence of 
cognate antibodies in the serum (e.g. measurement of antibodies to the AIDS virus) with a peptide might 
55 improve specificity. Currently, many such assays tend to pick up false positives that have to be further 
evaluated to make sure they are truly positives. Some people have tried to use peptides derived from the 
virus. This approach resulted in a more specific assays. Using peptide mimetics and the library approach it 
would be possible to further narrow down the specificity. 
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The difficulties associated with use of antibodies in immunoassays are numerous: (1) Instability of the 
antibody protein, (2) Changes in the specificity of the antibodies from batch to batch and upon storage. (3) 
Difficulties of labeling with tracer. (4) High price. (5) Difficulties in disinfection e.g. when preparing probes. 
(6) Denaturation upon repeated elution of the analyte. (7) Difficulties in immobilization onto a solid support. 

5 (8) Bivalency of the antibodies is sometime disadvantageous. 

Replacing of the antibodies with peptides might overcome some of the above difficulties. (1) As 
compared with the antibody, the peptide has no "conformation" which may be destroyed {leading to an 
inactive protein), (2) The batch to batch consistency of the peptide is much easier to control as compared 
with antibodies. (3) The tracer may be synthesized together with the binding part of the peptide. (4) The 

io price of synthetic peptides is typically lower than that of antibodies. (5) The peptides are stable to high 
concentration of organic solvent which may be used in disinfection. These solvent would destroy antibodies. 
(6) Having no "fixed" conformation, the peptides would not be destroyed by extended use. (7) The peptide 
does not need to be immobilized. It can be synthesized in situ on many type of supports. (8) The peptide 
may be mono, di- or poly-valent as required. 

75 Inhibition of immune cell migration by molecules that bind to and block the cell surface receptors 
responsible or chemotaxis. Some of the molecules that are expressed on the cell surface control the 
ability of the cells to respond to stimuli, These cell surface receptors control, among other phenomena, also 
chemotaxis. These molecules need to be identified, cloned and purified. Once purified the molecules could 
be used for selection of peptides which would bind to them. Some of the peptides might have the ability to 

20 specifically block the receptor and thus abrogate the ability of the cells to respond to specific stimuli. Such 
peptides could be used e.g. in prevention of inflammation or prevention of graft rejection. 

Inhibition of cytotoxic T lymphocyte (CTL) function by molecules that interfere with binding of 
the CTL with the target cell. The first step in the lysis of target cells by CTL is binding of the CTL 
receptor to a specific receptor on the target cells. Cloned and purified polypeptide chains of the CTL 

25 receptor could be used for screening of a peptide library for peptides that bind to the receptor. It is 
expected that some of the peptides selected would be capable of inhibiting the ability of the CTL to bind to 
their targets. Peptides that bind at the antigen binding site would inhibit the activity of specific CTL clones. 
However, peptides may be found that would bind outside the active site but still inhibit the ability of the 
receptor to bind to its target. Such peptides could be e.g. inhibition of autoimmune responses. 

so Molecules that temporarily inhibit the multiplication of stem cells. Can be used in order to 
minimize damage to the stem cells during chemotherapy. 

Molecules that interact with both tumor cells and cytotoxic T lymphocytes (CTLs) thereby 
targeting the CTLs into tumors and enhancing tumor cell killing. The first step in the attack of CTL on 
tumor cells is binding of the CTL to tumor cells. The binding is mediated by specific cell surface receptors 

35 found on the CTL. The CTL receptor bind to tumor antigens, expressed on the surface of tumor cells. 
Recent results indicated that it is possible to induce specific lytic activity also when the CTLs bind to the 
tumor cell via a bridging molecule e.g. a bifunctional antibody that recognizes the CTL receptor and the 
tumor antigen. A similar activity could be mediated by peptides having two sites: one site that would bind to 
a tumor cell surface antigen and another that would bind to the CTL receptor. Screening of a peptide library 

40 for peptides capable of binding each of the structures could be performed as described. Once such 
peptides are found, they would be synthesized into a single polypeptide chain, or other wise chemically 
conjugated and tested for their ability to induce specific lysis of the target tumor cells. 

Molecules that interact with and inhibit the activity of angiogenic factors or their receptors. 
Used for inhibition of angiogenic activity of tumors. The approach here would be similar to discovery of 

45 peptides which bind to and inhibit other cytokines. 

Molecules that inhibit multiplication of tumor cells. Peptides may be discovered that inhibit the 
activity of enzymes that participate in DNA synthesis or the synthesis of any of the precursors. The 
screening for such peptides would be based upon screening for peptides that bind to the desired enzyme 
and inhibit its activity. The specificity to tumor cells amy be mediated by another peptide that would bind to 

so a cell surface molecule that is internalized following binding of the peptides thereby allowing the enzyme 
inhibitor peptide access specifically into tumor cells. Such approach is currently under investigation using 
natural toxic peptides conjugated to antibody fragments, e.g. see, Better, M., Bernhard, S.L. Lei, S.P., 
Fishwild, D.M., Lane. J.A., Carroll, S.F., and Horwitz, A.H., "Potent anti-DC5 Ricin A Chain Immunocon- 
guates for Bacterially Produced Fab 1 and F{ab") 2 ," Proc. Natl. Acad. Sci.. (USA) , 90:457-461, (1993). 

55 Molecules that enable passage through the cell membrane or the membrane of endosomal 
vesicles. 

— Enzyme inhibitors. 

-- Antisense polynucleotide or PNAs. 
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Molecules that enable passage through the blood- brain barrier. 

Molecules that target functional groups into tumors by way of binding to cellular components. 
The functional groups may include: 
toxins 

5 - radio nuclides: For imaging and therapy. 

- Neutron capturing agents 

- Enzymes: e.g. glucose oxidase. 

Molecules that enable enhanced passage through skin, upper respiratory tract or lungs. 
Sorting of cells following their fluorescent surface labeling or magnetic sorting, e.g. Purging of 
w tumor cells from bone marrow cells in vitro. Like antibodies, peptides may be used as specific labels for 
cells. The use of labeled peptides would allow analysis and sorting of the desired cells. 
Inhibitors of embryo implantation or development 

Molecules that would serve for affinity purification of other molecules. One popular method for 
purification of recombinant (and native) proteins is affinity chromatography. The affinity ligands usually are 

J5 mimetic dyes and sometimes also monoclonal antibodies. Peptides could be selected that specifically bind 
to a target of choice, thus allowing its purification by affinity chromatography. They might allow a 
combination of the specificity of antibodies coupled with the ease of use of mimetic dye columns, e.g. 
depyrogenization with 1 N NaOH. 

Enzymatic catalysis. In some processes (e.g. immunoassay) it is needed to immobilize proteins or 

20 other molecules on surfaces. Peptides that bind the desired ligand could be used for such immobilization. 
Immobilization of cells (adherence of cells to peptide containing surfaces) is a specific example that has 
already been demonstrated. See Fernandez, M.C., Multenix, M.S., Christner, R.B. and Mortensen, R.F., "A 
Cell Attachment Peptide From Human C-Reactive Protein," J. Cell Biochem. 50:83-92, (1992); Chen. Y.- 
C.J., Danon, T., Sastry, L, Mubaraki, M., Janda, K.D. and Lerner, R.A., "Catalytic Antibodies from 

25 Combinatorial Libraries." J. Amer. Chem. Soc. 115:357-358. (1993); and Lesley, S.A., Patten, P.A. and 
Schultz, P.G., "A Genetic Approach to the Generation of Antibodies with Enhanced Catalytic Activities," 
Proc. Natl. Acad. Sci. (USA) , 90:1160-1165, (1993). 

Molecules that would serve as mammalian tissue culture additives. Recently it has been found that 
some proteins can be used to replace serum in propagation of mammalian cells in tissue culture. Among 

30 the proteins are insulin and some growth factors. When the receptors of these proteins are known, peptides 
could be selected from the library that would bind to the receptor, e.g., the insulin receptor. The ability of 
the peptide to activate the receptor and thus replace the proteins could be tested at a second stage 
The advantages for the use of peptides would be both economical (peptide are cheaper than proteins) and 
regulatory (it is basically safer to use peptide as compared to proteins from any source, natural or 

35 recombinant). 

Molecules which are antagonists (partial or complete) of ligands (hormones, cytokines, 
neurotransmitters, toxins, etc.) (Note: The term "Partial antagonist" means that only some of the 
biological activities of the ligand would be inhibited while other activities would not be impaired.) Activity is 
mediated by interaction with any of the following: 
40 _. The ligand (hormone, cytokine, neurotransmitter, steroid, leukotrienes, releasing factor, etc.): Preven- 
tion of the ligand interaction with the receptor. 

.. Th e receptor: Binding to the receptor and thereby prevention of the interaction of receptor with the 
ligand or with signal transduction molecules. 

--The signal transducing molecule(s): Prevention of the activation of the molecule by the receptor or 
45 prevention of the signal transduction by prevention of the activation of the signal transducer. 

With regard to agonists (complete or partial) or antagonists (complete or partial) of natural peptides, the 
following peptides may be targets: 

Angiotensin (types I and II), Biberotoxin; Endothelin; Sarafotoxin; Bombesin; Calcitonin; Calpain; 
Cholescystokinins; Cecropin; Corticotropin releasing hormone (CRH); Defensin; Galanine; Gelsolin; 

so Glucagon; GnRH - Gonadotropin releasing hormone; Leupeptin; MSH - alpha melanotropin; NPY - 
Neuropeptide Y; Peptide Leukotrienes; Somatostatin; Substance P; Tachykinin; Vasopresin; VIP; Opiate 
family. Natural peptides are not convenient for use as drugs. They are relatively unstable, are difficult to 
deliver, tend to have short half lives in the circulation and sometimes lack desired specificity. Peptide 
mimetics could be selected that bind to the receptors of the natural peptides and mimic or antagonize their 

55 biological activities. Since the library would probably yield several possible lead structures, it is possible 
that some of them would be more suitable for use as drugs as compared with the original native peptide. 
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Affinity Reagent 

Peptides which bind to a target of interest are identified by their binding to an affinity reagent. An 
affinity reagent is a chemical entity whose binding characteristics are similar to those of the target of 

5 interest, and whose binding to a peptide (or to another reagent which is bound to the peptide) causes a 
detectable physical or chemical change to occur. Typicafty, the affinity reagent is a target molecule (or an 
analogue thereof), conjugated with a label, such as a radioisotope, a fluorophore, a colorophore, an enzyme, 
an enzyme substrate, or an electron-dense moiety. The label may be observable directly, or it may be 
detectable only by virtue of further processing. For example, a biotinylated target molecule may be bound 

w to the peptide, then an enzyme- labeled avidin bound to the biotin tag. and finally the enzyme provided with 
a substrate, the enzymatic reaction product having a distinctive color. The preferred reagents have 
fluorescent or enzymatic labels. If the label is fluorescent, it is desirably rhodamine. If the label is 
enzymatic, alkaline phosphatase is preferred. 

is Amino Acids and Peptides 

Amino acids are the basic building blocks with which peptides and proteins are constructed. Amino 
acids possess both an amino group (-NH 2 ) and a carboxylic acid group (-COOH). Many amino acids, but 
not all, have the structure NH 2 -CHR-COOH, where R is hydrogen, or any of a variety of functional groups. 

20 Twenty amino acids are genetically encoded: Alanine, Arginine, Asparagine, Aspartic Acid, Cysteine, 
Glutamic Acid, Glutamine, Glycine, Histidine, Isoleucine, Leucine. Lysine, Methionine, Phenylalanine, Pro- 
line, Serine, Threonine, Tryptophan, Tyrosine, and Valine. Of these, all save Glycine are optically isomeric, 
however, only the L-form is found in humans. Nevertheless, the D-forms of these amino acids do have 
biological significance; D-Phe, for example, is a known analgesic. 

25 Many other amino acids are also known, including: 2-Aminoadipic acid; 3-Aminoadipic acid; beta- 
Aminopropionic acid; 2-Aminobutyric acid; 4-Aminobutyric acid (Piperidmic acid); 6-Aminocaproic acid; 2- 
Aminoheptanoic acid; 2-Aminoisobutyric acid, 3-Aminoisobutyric acid; 2-Aminopimelic acid; 2,4-Dia- 
minobutyric acid; Desmosine; 2,2'-Diaminopimetic acid; 2,3-Diaminopropionic acid; N-Ethylglycine; N- 
Ethylasparagine; Hydroxylysine; allo-Hydroxylysine; 3-Hydroxyproline; 4-Hydroxyproline; Isodesmosine; 

30 allo-lsoleucine; N-Methylglycine (Sarcosine); N-Methyfisoleucine; N-Methylvaline; Norvaline; Norleucine; and 
Ornithine. 

It has been found to be convenient to assign each of the amino acids used in the libraries 
disclosed herein an ID number for greater simplicity of reference. These ID numbers appear in Table 
101. 

35 Peptides are constructed by condensation of amino acids and/or smaller peptides. The amino group of 
one amino acid (or peptide) reacts with the carboxylic acid group of a second amino acid (or peptide) to 
form a peptide (-NHCO-) bond, releasing one molecule of water. Therefore, when an amino acid is 
incorporated into a peptide, it should, technically speaking, be referred to as an amino acid residue . 

40 Peptide Synthesis: An Overview 

In a standard "Merrifield" synthesis, a side chain-protected amino acid is coupled by its carboxy 

terminal to a support material ( such as a resin. A side chain and amino terminal protected amino acid 

reagent is added, and its carboxy terminal reacts with the exposed amino terminal of the insolubilized 
45 amino acid to form a peptide bond. The amino terminal of the resulting peptide is then deprotected, and a 

new amino acid reagent is added. The cycle is repeated until the desired peptide has been synthesized. 

For an overview of techniques, see Geisaw, Trends. Biotechnol., 9:294-95 (1991). 

In the conventional application of this procedure, the amino acid reagent is made as pure as possible. 

However, if a mixture of peptides is desired, the amino acid reagent employed in one or more of the cycles 
so may be a mixture of amino acids, and this mixture may be the same or different, from cycle to cycle. Thus, 

if Ala were coupled to the resin, and a mixture of Glu, Cys, His and Phe were added, the dipeptides Ala- 

Glu. Ala-Cys, Ala-His and Ala-Phe will be formed. 

When, during a synthetic cycle, a pure amino acid is added, the resulting residue in the peptide 

molecules being synthesized is called a constant residue. If a mixture of amino acids is employed, the 
55 added residue is called a variable residue. The component amino acids of the mixture, which are the only 

amino acids which can occupy that variable residue position, are called the "set" of that variable residue. 

The set for one variable residue may be different from that of the next one. When any of the residues 

added during the synthesis of a peptide is a variable residue, so that the synthesis deliberately produces a 
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mixture of peptides, the mixture is termed a peptide library. The differences among the peptide molecules 
of the library will lie, essentially at, and only at, the predetermined variable residue positions. 

Peptide Library 

5 

A peptide library may consist essentially only of peptides of the same length, or it may include peptides 
of different length. The peptides of the library may include, at any variable residue position, any desired 
amino acid. Possible sets include, but are not limited to: (a) all of the genetically encoded amino acids, (b) 
all of the genetically encoded amino acids except cysteine (because of its ability to form disulfide 

to crosslinks), (c) all of the genetically encoded amino acids, as wed as their D-forms; (d) all naturally 
occurring amino acids (including, e.g., hydroxyproline); (e) all hydrophilic amino acids; (f) all hydrophobic 
amino acids; (g) all charged amino acids; (h) all uncharged amino acids; etc. The peptide library may 
include branched and/or cyclic peptides. 

The "size" of the library is the estimated number of peptide molecules in it. Preferably the size of the 

rs library is at least 10 u , more preferably at least 10 16 , still more preferably at least 10 18 , most preferably at 
least tO 20 molecules. If there are 6 x 10 13 peptides per bead, even 10 6 beads would provide 6 x 10 19 
peptides, while 10 8 beads would carry 6 x 10 21 peptides. 

The "diversity ("degeneracy") of the library is the expected number of unique peptide sequences in the 
library. The method of the present invention does not have a technically imposed lower limit on library 

20 diversity. However, there would be no point to constructing a library with a diversity of only two. Desirably, 
the library should be sufficiently diverse so that it would be advantageous to simultaneously synthesize and 
screen the library, rather than prepare and test each peptide individually. For this reason, the library will 
ordinarily have a diversity of at least 10 3 . A further consideration is whether the library has a diversity 
comparable to, or greater than, that of the libraries described in the Background Art. Preferably, the 

25 diversity of the library is at least 1 0 s , more preferably at least 10 10 , still more preferably at least 10 12 , and 
most preferably at least 10 H unique sequences. A diversity of 10 u would be achieved with 10 8 beads each 
bearing 10 6 sequences. The "sequence set" of the library is the set of sequences which, given the choice 
of constant and variable residues, and the sets for each variable residue, could theoretically be presented in 
the library. 

30 The "average sampling level" of the library is the size divided by the diversity, i.e., the average number 
of molecules having the same peptide sequence. Preferably, the average sampling level is sufficient for 
detection and at least partial sequencing. It is preferably at least 10 6 molecules per sequence, more 
preferably at least 10 7 , still more preferably at least 10 s . The average sampling level should be at least 
equal to the peptide-per bead detection limit (assumed to be presently 10 6 ), more preferably 10 times said 

35 limit to provide a margin of safety. 

While the peptide library may include di-, tri-, and tetrapeptides, the preferred minimum length of the 
peptides is five amino acids. There is no definite maximum length. 

Structured Peptide Library 

40 

A structured peptide library is one in which peptide synthesis on a collection of beads (or equivalents) 
is controlled so that the repertoire of sequence variation on a single bead is limited to a predetermined 
subset of the allowed universe of sequence variation for the entire library. 

Such a library is formed by stepwise synthesis of the peptides on the beads by a protocol which 
45 includes one or more "structured random" addition cycles. (Optionally, one or more "unstructured random", 
or "nonrandom", cycles may be utilized as well.) 

A "structured random" cycle is one in which a variable residue is added, but some degree of control is 
exercised as to which beads receive which amino acids of the set. A "nonrandom" cycle is one in which all 
growing peptides of the library are reacted with a pure amino acid addition reagent. An "unstructured 
so random" cycle 1s one in which they are all reacted with a "mixed amino acid addition reagent", the mixture 
including all amino acids belonging to the set defined for that variable residue position. 

In the simplest form of "structured random cycle," the beads are divided into N aliquots, where N is the 
number of amino acids in the set of that variable residue. Each aliquot receives a different one, and only 
one. of those N different amino acids. As a result, all peptides on a bead in a given aliquot have the 
55 identical amino acid at the variable residue position, in question. This is called a "fully structured" cycle. 

There are circumstances, however, when it is appropriate to react each aliquot with a mixture of a 
unique subset of the amino acids in the set of the variable residue in question. For example, the set for the 
residue position, considering the library as a whole, may be 100 amino acids. The beads may be divided 
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into aliquots, A, B, C and D, which are reacted with mixtures A' (AAs 1-25), B' (AAs 26-50), C {amino acids 
51-75) and D' (amino acids 76-100), respectively. This is an example of a "partly structured" cycle. 

The number of different aliquots to which a bead may be assigned during a particular structured cycle 
is the "partitioning factor" for that cycle. The number of different permutations of aliquot assignments which 

5 an individual bead may experience as the library is synthesized is the "library partitioning factor", the 
product of the partitioning factors for the individual cycles {the partitioning factor for an unstructured cycle is 
unity). The expected number of beads in the library that will have been subject to a particular sequence of 
aliquot assignments in the partitioning steps (B L ')is the total number of beads in the library (B L ), divided by 
the library partitioning factor. B L ' is preferably at least one, more preferably at least two, still more 

w preferably at least ten. 

If there are 10 7 beads in the library (B L ) and there are three structured cycles, each with a partitioning 
factor of 100, the library partitioning factor is 1 0 6 . and B L 'is 10. If four structured cycles were employed, a 
partitioning factor of 100 per cycle would be too high; a factor of about 90 would be acceptable (90* -7.43 
x 10 6 ). If a larger number of beads could be screened, the library partitioning factor could be increased. 

J5 The expected number of identical peptide molecules on a single bead (M B ') is equal to the expected 
number of peptide molecules on the bead (M B ) divided by the expected diversity of the bead (D B ). The 
diversity factor (D B ) for the bead is the product of the diversity factors for that bead for each residue of the 
peptide. In an unstructured random cycle, the cycle's diversity factor is the same for both bead and the 
library, i.e., the number of different amino acids in the corresponding reagent. In a structured random cycle, 

20 the cycle's diversity factor for a bead is the number of different amino acids in the reagent reacted at that 
time with that bead ( for a fully structured cycle, it is unity ). 

The number of peptide molecules which may be carried by a single bead is a function of the surface 
area of the bead, and the number of potential simultaneous peptide attachment sites on that surface. The 
method of the present invention requires that this number (M B ) be at least two (which would be technically 

25 feasible only if a single peptide molecule could be detected and sequenced, and which would allow only 
two different sequences per bead). For practical reasons, M B is at least 10 2 , preferably at least to 3 , more 
preferably at least 10 6 , still more preferably at least 10 9 . even more preferably at least 10 12 . Examples 1-4 
assume a value of 6x1 0 1 3 molecules/bead. 

The number of beads in the library (B L ) is limited only by the number of beads which may be screened. 

30 Preferably, at least 10 7 beads, more preferably 10 8 or 10 9 beads, are screened. It is likely that mechanical 
assistance would be required to effectively screen a larger number of beads in a single library. 

The number of binding peptides which must be carried by a single bead for the binding assay to be 
able to determine whether those molecules specifically bind the affinity reagent is a function of both the 
degree reagent, and the sensitivity of the assay. Preferably, the assay requires no more than 10 7 , no more 

35 than 10 6 binding molecules, per bead, for identification. Also, it is preferable, that no more than ten, more 
preferably no more than two, still more preferably no more than one such bead is needed for detection. 
The maximum potential diversity of the peptide library is a function of 

(a) the number of peptide molecules which may be carried by a single bead, 

(b) the number of beads which may be screened, 

40 (c) the number of binding peptide molecules which must be carried by a single bead for the binding 
assay to be able to determine whether those molecules specifically bind the affinity reagent, 

(d) the number of at least partially identical peptide molecules which must be carried by a single bead 
for the common portion of their amino acid sequence to be sequenceable, and 

(e) the required level of statistical confidence that essentially all theoretically synthesized peptides are 
45 actually present in detectable and sequenceable amounts. 

it will be recognized that the person of ordinary skill will take advantage of advances in the 
binding assay, peptide synthesis, and peptide sequencing arts so as to achieve a higher level of 
diversity in the library. Consequently, while for the purpose of calculations demonstrating the 
feasibility of the present invention, it may be assumed that 10 s - 10 s beads may be screened 
50 manually, that 6 X 10 73 peptide molecules may be packed on a single bead, that 10 s molecules are 
required for detection of binding, and that 10 pmoles of peptide (about 1.5 X 10 n hexapeptide 
molecules) are required for sequencing, these limitations should not be imposed on the scope of the 
present Invention if they become technologically obsolete. 

The first limitation on the diversity of the library is the number of peptide molecules in it. This is equal 
55 to the number of beads in the library, multiplied by the number of molecules per bead. Thus, if there are 
t0 7 beads, and 6 x 10 13 peptides per bead, there are 6 x 10 20 peptide molecules in the library. 

The second limitation is imposed by the detection technology. For detection to occur, there must be 
one or more beads each of which bears a minimum number of identical peptide molecules which have the 
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desired binding property. For example, if the detection technology requires one positive bead, and at least 
10 6 molecules on that bead which have the appropriate sequence, the maximum permissible diversity of 
the library is (6 x 10 20 /10 G =)6 x 10 u different peptide sequences. Thus, such a library might be a 
pentapeptide library, with 500 different amino acids at each position (500 s <6 x 10 13 ), a hexapeptide library, 

5 with nearly 200 different amino acids at each position (200 6 = 6.4 x 10 13 ), an octapeptide library with over 
40 different amino acids at each position (40 s = 6.6 x 10 12 ). 

The diversity of a single bead is also limited. If there are 6x1 0 1 3 attachment sites, and 10 6 identical 
peptide molecules are required for detection, the maximum bead diversity is 6x1 0 7 . 

Each unstructured random cycle increases the diversity of a single bead, as well as of the library, by its 

w diversity factor. Each fully structured random cycle increases the diversity of the library, but not the 
diversity of the bead. A bead diversity limit of 6x1 0 7 woufd be approached by three unstructured cycles of a 
little less than 400 amino acids each, four unstructured cycles of almost 90 amino acids each, and so forth. 

In the example above, there would be no advantage to adjusting the relative number of structured and 
unstructured cycles. With two structured and four unstructured cycles of 100 AAs each, there would only be 

75 (6x10 13 /100* =)6x10 5 molecules of each peptide sequence on each bead, below the assumed detection 
limit of 10 6 . With four structured and two unstructured cycles of 100 AAs each, there would be 100 + ( = 10 s ) 
different permutations of bead partitions, but only 10 7 beads, so that the expected number of beads 
subjected to a given series of four aliquot assignments would be only 0.1, not at least 1.0 as is desirable. 
However, if our underlying assumptions are changed, the relative merits of structured and unstructured 

20 cycles also change. If, for example, the peptide density on the bead were higher, or the detection limit 
lower, the number of unstructured cycles could be increased. If the number of beads were higher, more 
structured cycles would be feasible. And finally, if fewer amino acids were used in each cycle, there could 
be more cycles, structured or unstructured. 

The amount of peptide required for sequencing by present technology is 10 pmoles, which corresponds 

25 to about 6x1 0 12 molecules (hexapeptides). If the diversity on a single bead were limited to that required for 
sequencing the entire peptide at once, the approach would be of marginal value. With 6x1 0 13 molecules per 
bead, the diversity would be limited to {6x1 01 3/6x1 0 12 = )10. However, the present method contemplates 
that only a partial sequence is determined initially. 

Thus, the peptides of the initial library consist of a first familial portion and of a second individual 

30 portion. The first portion, which usually comprises one to five, preferably three amino acids, is common to 
(or of limited variability among) all peptides on a given bead. The remainder of the peptide sequence is the 
portion which fully or primarily distinguishes it from the different peptide sequences carried by the same 
bead. 

In the subsequent sublibraries, each peptide may be characterized as having a first portion which is 
35 "universal," i.e., possessed by all peptides in that sublibrary, a second portion which is familial to all 
peptides on a single bead of that sublibrary, and a third, individual portion. It is only necessary that each 
possible residue of the familial subsequence on the active bead be present in a sequenceable amount. If 
there are 100 pmole per bead, and 10 pmole is sequenceable, there could be up to ten different amino 
acids (each at 10 pmole) in a given residue position among the peptides on a single bead. If the residues of 
40 the familial subsequence are variable residues, several secondary will be studied to determine which of the 
familial subsequences belonged to an active peptide of the primary library. 

During synthesis of the familial portion of the peptide, in each cycle in which a variable residue is to be 
added, the beads are divided into N aliquots, where N is the number of amino acids in the set of that 
variable residue, i.e., the number of different amino acid reagents used in that cycle if the cycle is fully 
45 structured. Each aliquot of beads is reacted with an amino acid reagent providing one, and only one, of the 
amino acids of the set. This is conveniently done in N different reactors. The aliquots are then pooled. More 
typically, an encoding factor of two is used, so each aliquot is reacted with a mixture of two different amino 
acid reagents. 

If, however, the variable residue to be added is within the individual portion of the peptide, a mixture of 
so all the amino acids of the set is added to all of the beads. 

A library may include peptides of different lengths. Such a library may be constructed by modifying one 
or more structured cycles so that one of the aliquots is not reacted with an amino acid. Alternatively, in any 
random cycle, the reagent may comprise a mixture of amino acids and oligopeptides. Either way, a library 
may be formed having peptides of different lengths but with a common familial portion. 

55 
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Beads 

The term "beads" is not intended to be limited to spherical particles, but includes any small, discrete, 
solid elements upon which a structured peptide library may be synthesized and screened. Thus, the 
5 "beads" must be formed of a material with which peptides can be conjugated, and which is not 
substantially bound by the affinity reagent. In addition, the beads must be capable of being divided into 
aliquots and pooled back together, as described above, and of being separated later, during the screening 
process, according to the ability of their conjugated peptides to bind an affinity reagent. 

Preferably, the beads are made of aminomethylated polystyrene crosslinked with divinyl benzene. Other 
to potentially suitable materials include Tentagel (polyethyieneglycol modified polystyrene cross linked with 
divinyl benzene). The suitability of other support materials for use in the present invention may be evaluated 
against the following criteria: 

a. The ability to synthesize peptides on the beads: The beads should be stable for all the solvents used 
in the peptide synthesis. 

75 b. They should contain a free amino group, or a suitable stable but cleavable linker. However, it should 
be noted that a cleavable linker is not required. 

c. The beads should be mechanically stable during synthesis, screening and handling. 

d. The size of the beads should be large enough to allow manual handling, or whatever alternative 
handling means is contemplated. 

20 e. The peptide capacity of the bead should be at least 10 pmote of peptide per bead, or whatever lower 
limit is rendered feasible by advances in sequencing technology. A capacity of about 100 pmole is 
preferable, 

f. The beads should display a low degree of non-specific adsorption of ligands of choice and of proteins 
in general. (These criteria should not be considered absolute requirements.) 
25 Beads which may be tested for suitability include: 

Amino methyl PERSEPTIVE beads: (Perseptive, Cambridge, Massachusetts, USA). 

Beads based on the polymer TSK gel (TosoHaas, in Stuttgart, Germany). 

Matrix based upon FastFlow Sepharose (Pharmacia Uppsala, Sweden). 

The number of peptide molecules which may be placed on a given bead is a function of the surface 

30 area of the bead and of the number of reactive sites per unit area. While there is no definite lower limit on 
carrying capacity, the number of peptide molecules per bead is one of the factors limiting the potential 
diversity of the library which can be reliably screened (i.e., so that one may with reasonable confidence 
assume that all of the peptides which were theoretically expected to be produced were in fact produced in 
detectable amounts, and, if screening were negative, assert that none of the expected peptides had the 

35 desired affinity for the target). Nor is there a definite upper limit on carrying capacity, however, if the 
reactive sites are too closely spaced, it is possible that there would be stearic hindrance of binding. 
Preferably, the bead has a diameter of 50 to 500 microns, (e.g., 100 microns). The bead is preferably able 
to carry at least 25 pmole peptide, more preferably at least 100 pmole peptide. 

Since the packing efficiency (beads per unit volume of reactor) decreases with increasing bead 

40 diameter, it is desirable to use smaller beads. The number of beads per unit volume of reactor is directly 
dependent upon the volume of each bead, assuming that all the beads in question are spherical. The 
volume of each bead is proportional to the third power of the bead diameter. The beads are porous, and the 
peptide is expected to be synthesized throughout the bead volume. The capacity of the beads (the amount 
of peptide on the bead) is therefore expected to be directly proportional to the bead volume and thus 

45 directly proportional to the third power of the diameter. Thus, if you increase the diameter of the bead from 
100 to 200 microns, it is expected that: 

a. The number of beads per unit volume of the reactor would be 8 fold lower. 

b. The peptide capacity of each bead would be 8 fold higher. 

The total peptide capacity per reactor volume would therefore be constant. 

50 

Alternative Supports 

The structured oligomer libraries of the present invention may be adapted to libraries in which peptides 
are displayed on supports other than beads. However, the supports must be individually addressable, so 
55 that an individual support element displays a known family of oligomers, having a substantially common 
subsequence whose sequence is determinable when the individual support element is examined. 

One example is an adaptation of the light-directed, spatially addressable parallel chemical synthesis 
method of Fodor, et at., Science, 251:767 (1991). In Fodor's method, synthesis occurs on a solid support 
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sheet. The pattern of exposure to light (or other forms of energy) through a mask (or other spatially 
addressable means) determine which regions of the support are activated for chemical coupling. This 
activation results from the removal of photolabile protecting groups from the illuminated area. The support is 
exposed to the addition reagent, which reacts only in the region which underlay the window of the mask. 

5 The substrate is then illuminated through a second mask, with a different window. Combinatorial masking 
strategies are used to form a large number of compounds in a small number of chemical steps. For 
example, in the first round, the support surface may be divided into twenty vertical stripes, each of which (in 
separate illumination-reaction cycles) receives one of the twenty genetically encoded amino acids. The 
surface is then divided into twenty horizontal stripes, which are similarly treated in the second round. All 

jo 400 dipeptides are thereby synthesized. Obviously, by using 400 vertical stripes and 400 horizontal stripes, 
al! 160,000 tetrapeptides could be synthesized. A resolution of 50 microns was achieved by Fodor, et al. 
(His detection system was said to have a sensitivity limit of about 100 fluorescein molecules in a 10 square 
micron region.) 

The result of the combinatorial masking strategy is the formation of a large number of compounds, 
75 distributed across the support. However, in any given "synthesis area", i.e., a commonly treated area of the 
support, one product predominates. 

In the present modification of Fodor's method, the entire support is subjected to one or more rounds of 
reaction with a mixture of amino acids. Then one or more additional amino acids are added to each growing 
peptide, using Fodor's method. The result is that within a synthesis area, one finds not a single peptide 
20 sequence, but rather a family of related peptides having a common amino terminal, and a herterogeneas 
carboxyterminal. 

Unlike the "bead" embodiment, there is no need to sequence this amino terminal, as the sequence of 
the terminal is deducible from the coordinates of the assay-positive synthesis area. The active peptides 
within the known family may be determined by synthesizing a secondary library, corresponding to the 
25 family of the primary library, and assaying for binding. If a support is divided into 50 micron square 
synthesis areas, each area is expected to display 100,000 - 1,000.000 peptide molecules. If 100 molecules 
per 50 micron square synthesis area are required for detection, each such synthesis area may present 
1,000 - 10,000 different peptide sequences. Thus, our adaptation of Fodor, et al.'s method allows it to 
explore a 1,000 - 10,000 fold more diverse universe of peptides. 

30 

Screening 

The peptide library is screened by exposing the pepttde-bearing beads to an affinity reagent as 
previously described. The reagent will become bound to beads bearing peptides having an affinity for the 

35 reagent. Excess reagent is removed and a signal, e.g., fluorescence or a color change, is produced to 
distinguish the interacting beads from the passive beads. 

The interacting beads are then removed, either manually, or by other means which detect either the 
presence of the reagent, or the generation of the signal. In one embodiment, a sorting reagent is employed 
which comprises magnetic beads coupled to antibodies (or other binding molecules) which bind the affinity 

40 reagent. For example, if the affinity reagent is a rabbit polyclonal antibody, the magnetic beads could be 
coupled to goat anti-rabbit antibodies. Preferably, the magnetic beads are of a mass substantially lower than 
that of the library beads, so as to reduce the risk of shearing the complex. However, the less massive the 
bead, the greater the magnetic field required for efficient separation. Preferably the beads have a mass of 
about 10~ n g to 10 _l3 g. The diameter of the magnetic beads matters as well. Large and light magnetic 

45 beads would suffer higher shear forces a compared to small and compact beads having the same mass. 
This is due to larger drag forces affecting the larger beads from fluid movements. Most of the magnetic 
beads available on the market are composed of derivatives of polystyrene. They do not have similar 
densities since they vary in the amount of metal in the bead. The most advanced magnetic sorter available, 
the MACS from Beckton and Dickinsion, utilizes magnetic beads of about 0.1 micron. B & D use a very 

so powerful magnet. Lesser machines, having less powerful magnets, use bigger beads. We believe however, 
that beads with about 1 micron diameter might be especially suitable. 

Rather than use separate sorting and affinity reagents, it is possible to use a sorting reagent which 
mimics the target of interest and therefore binds the peptides directly. The "signal" is then the separation of 
the bead by the sorting reagent. 

55 
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Sequencing 

"Positive" beads (those which carry binding peptides) are collected and individually sequenced. While a 
variety of sequencing techniques are known, all contemplate (a) cleaving off a single amino acid from one 

5 end of the peptide, (b) collecting and identifying the released amino acid, and (c) repeating steps (a) and (b) 
until the entire peptide has been sequenced, or further sequencing becomes impractical. Normally, 
sequencing is performed only on homogeneous peptide preparations. However, while a single bead bears a 
mixture of peptides, all of the peptides of that mixture have a common terminal portion, the "familial" 
portion, which is sequencable. 

w The "familial" portion of the peptides of the library may be the amino terminal portion, in which the 
sequential degradation must begin at the amino terminal, or it may be the carboxy terminal portion, in which 
case sequencing begins at the carboxyl end of the peptide. 

The primary sequence of amino acids in a peptide or protein is commonly determined by a stepwise 
chemical degradation process in which amino acids are removed one-by-one from one end of the peptide, 

75 and identified. In the Edman degradation, the N-terminal amino acid of the peptide is coupled to 
phenylisothiocyanate to form the phenylthtocarbamyl (PTC) derivative of the peptide. The PTC peptide is 
then treated with strong acid, cyclizing the PTC peptide at the first peptide bond and releasing the N- 
terminal amino acid as the anilino-thiozolinoe (ATZ) derivative. The ATZ amino acid, which is highly 
unstable, is extracted and converted into the more stable phenylthiohydantoin (PTH) derivative and 

20 identified by chromatography. The residual peptide is then subjected to further stepwise degradation. 

For carboxy terminal sequencing (of peptides synthesized with their amino terminal coupled to a 
support), the cleavage reagent may be a carboxy peptidase. 

The present invention is not limited to any particular method of sequencing; however the method 
chosen must be reasonably capable of identifying the sequence of the familial portion of the family of 

25 peptides on a single bead. 

Sequencing of branching peptides 

The N-terminal sequencing procedure is standard. The difficulty is in identification of the correct 
30 structure. If we know that a certain peptide is branched and we know the degree of branching (biantenary, 
triantenary, etc.) and its location relative to the sequence, we would be able to deduce the correct structure 
based upon data collected from the N-terminal sequencing and our knowledge of the secondary synthesis 
approach. However, if we allow a library to contain both linear and branched peptides, or if in a branched 
library we would allow random branching, it would be very difficult to guess the structure based solely upon 
35 sequence information coupled with a limited number of secondary libraries. 

The best approach would probably be to design independent linear or branching libraries and to design 
each branching library with a single architecture. 

The utilization of the full potential of the branching approach would be dependent upon use of N- 
terminal orthogonal protection of each branch. Otherwise, many possible structures would fail to appear 
40 within the library. 

Sequencing of cyclic peptides 

Sequencing of cyclic peptide produced by intra-chain cystine formation is straightforward, following 
45 reduction of the disulfide bonds. With some forms of cyclization, determination of N-terminal sequence by 
Edmann degradation is not possible and thus the cyclization approach would have to be accompanied by 
an encoding procedure. 

Encoding 

50 

Certain amino acids (e.g., serine, histidine) are difficult to identify by the standard procedures, either 
because the AA are destroyed by the Edmann degradation procedure or because adequate references for 
their identification are not available. This problem may be overcome by any of several means: 

(a) increasing the proportion of the difficult amino acids in mixed amino acid reagents used in the course 
55 of peptide synthesis; 

(b) use of more than one sequencing procedure; an amino acid which is difficult to analyze by one 
procedure may be easier to detect by another. 
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(c) labeling difficult amino acids with a detectable but non-interfering label prior to adding them to the 
nascent peptide during peptide synthesis; and/or 

(d) in the substitution set for a given residue position in the peptides of a single bead, partnering each 
difficult-to-sequence amino acid with a readily sequencable amino acid ("encoding"). 

5 Alternative (a), while conceivably workable, would create an imbalance in the structure of the intended 

library. 

Alternative (b) requires splitting the bead in two and subjecting each half to a different sequencing 
procedure, since these procedures are destructive. 

Alternative (c) is practical if the label is non-interfering and is not modified adversely by the synthesis or 
70 sequencing procedure. 

Alternative (d) is preferred, and deserves further explanation. Consider a bead in a peptide library which 
bears peptides having the sequence 

Fi - F2 - F3 - I* - Is - U, 

where F n are "familial" residues and l n are "individual" residues. Under normal usage, the "F" residues are 
75 unique for a given bead . Suppose, however, that one or more of F„ are difficult-to-sequence amino acids, 
such as Tryptophan. If so, then the sequencing of the peptides on this bead will not yield useful results, as 
the amino acid in that position would be undetermined. While one could exhaustively test all the 
possibilities, there is an alternative. 

Suppose that only F1 was Tryptophan. One could instead structure the library so that on this bead, F, 
20 was either Tryptophan or an easy-to-sequence amino acid, such as glycine. If this bead were found to be 
positive, amino-terminal sequencing would reveal the sequence 
Gly - F 2 - F 3 

for the first three amino acids of the peptides on the bead. The actual binding peptide could by Gly - F 2 - 

F 3 , or it could be Trp-F 2 - F 3 . The answer would be determined by screening a secondary library which 
25 corresponds to the family of peptides found on this positive bead. 

A typical usage of "encoding" would be in screening D-amino acids, as conventional sequencing does 

not distinguish D- and L-forms of the same amino acid. Technically it is possible to separate using a proper 

column, between D and L amino acids, or in a pure chiral solution to determine their optical nature. 

However, all of the known methods need much larger amounts of material for analysis than are obtained 
30 when sequencing a single bead. Thus, it is impractical to determine the chiral identity of the sequenced 

amino acids from individual library beads. Another use for "encoding" would be to distinguish Glu from Gin, 

or Asp from Asn. 

Thus, according to our strategy we incorporate in the "familial" positions (e.g., 1-3 from the N-terminus) 
of the primary library, two amino acids. The amino acid pairs are selected so that each "difficult" amino 
35 acid is paired with an "easy" one. When sequencing is performed, the signal generated by the "easy" 
amino acid indicates the existence of the "difficult" one even though it is not registered by the sequencer. 

In short peptides, the N-terminal might influence the activity of the peptide, e.g. When the peptide is 
composed of mostly hydrophobic amino acid residues, the hydrophilic primary amine of the N-terminal 
might disrupt the peptide activity. However, it is not possible to sequence N-terminal blocked peptides by 
40 Edmann degradation. 

The encoding strategy may be used to analyze the structure of N-terminal modified peptides. In the 
final synthesis cycle, we use a mixture of FMOC protected {a-amine blocked by FMOC) and blocked (a- 
amine blocked with acetic or benzoic acid) amino acids. Upon sequencing, the signal obtained from the N- 
terminal amino acid implies the existence of the blocked N-termina! amino acid which is stable to the 
45 Edmann degradation. 

Non-Peptide Libraries 

Polymers other than peptides may be used in the structured libraries of the present invention, provided, 
50 (a) they can be synthesized, in a manner permitting "structuring", (b) when so presented, they are bindable 
by a target material, and (c) the polymer molecules on a single bead may be sequenced, at least partially. 
Suitable polymers include peptoids, nucleic acids, and carbohydrates. 

It should be noted that if a particular type of polymer cannot be sequenced readily, it can be studied 
indirectly by means of an "encoding" strategy in which the beads carry both peptides and the non-pepttde 
55 polymer. Cf. Brenner and Lerner, "Encoded Combinatorial Chemistry," PNAS (USA) , 89:5381-83, (1992), 
who disclose chemically linking a "genetic tag" (amplifiable by PCR) to a polymer which is not itself 
genetically encodable. The peptide need not, however, be chemically linked to the non-peptide polymer in 
the present method. For example, a library may be structured so that on a given bead, there is a single 
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peptide sequence, and a family of related nucleic acid sequences. Sequencing the peptide then identifies 
the nucleic acid family. 

It may be desirable to present difficult-to-sequence polymer libraries by our adaptation of the tight- 
directed, spatially addressable parallel chemical synthesis method of Fodor, et al„ as the familial sequence 
5 is then indicated by the spatial address. 

Peptoids are peptide analogues in which the peptide bond (-NHCO-) is replaced by an analogous 
structure, e.g.. -NRCO-. See Simon, et a!., P.A., "Peptoids: A modular approach to drug discovery." Proc. 
Natl. Acad. Set. USA 89:9367-9371. 1992. and c.f. Gilon, et al. "Backbone cyclization: A new method for 
conferring conformation constraint on peptides." Biopolymers 31:745-750, 1991. When the polymer is a 
jo peptoid, it may be synthesized as described in the Reference Example. In general, these peptoids are 
sequenceable just as peptides are, though the sensitivity or accuracy may be different. 

When the polymer is a nucleic acid, conventional DNA or RNA synthesis and sequencing methods may 
be employed. The usual bases are the purines adenine and guanine, and the pyrimidines thymidine (uracil 
for RNA) and cytosine. However, unusual bases, such as those listed below, may be incorporated into the 
75 synthesis or produced by post-synthesis treatment with mutagenic agents. 

4- acetylcytidine. 

5- (carboxyhydroxylmethyl)uridine. 
2'-0-methylcytidine. 

5-carboxymethylaminomethyl-2-thioridine. 
20 5-carboxymethylaminomethyluridine. 

dihydrouridine. 

2*-0-methylpseudouridine. 

beta.D-galactosylqueosine 

2'-0-methylguanosine. 
25 inosine. 

N6-isopentenyladenosine. 

1- methyladenosine. 

1 -methytpseudouridine. 
1 -methy Iguanosine. 
30 1-methylinosine 

2,2-dtmethylguanosine. 

2- methyladenosine. 

2- methylguanosme. 

3- methylcytidine. 
35 5-methylcytidine. 

N6-methyladenosine. 
7-methylguanosine. 
5-methylaminomethyluridine. 
5-methoxyaminomethyl-2-thiouridine. 
40 beta.D-mannosylqueosine. 

5-methoxycarbonylmethyiuridine. 
5-methoxyuridine 

2-methylthio-N6-isopentenyladenosine. 

N-((9-beta-D-ribofuranosyl-2-mehtytthiopurine-6-yl)carbamoyl)threonine. 
45 N-((9-beta-D-ribofuranosylpurine-6-yl)N-methylcarbamoyl)threonine. 

uridine-5-oxy acetic acid methylester. 

uridine-5-oxyacetic acid (v). 

wybutoxosine. 

pseudouridine. 
so queosine. 

2-thiocytidine 

5-methyl-2-thiouridine. 

2- thiouridine. 

4- thiouridine. 
55 5-methyluridine 

N.^9.beta-D-ribofuranosylpurine-6-yl)carbamoyl)threonine.2'-0-methyl-5-methyluridine. 

2'-0-methyluridtne. wybutosine. 

3- (3-amino-3-carboxypropyl)uridine. 



17 



EP 0 639 584 A1 

DNA may be synthesized by the stepwise addition of nucleotides to a nascent chain. The first step of 
the synthesis may be the coupling of a nucleoside, via a succinyl linkage, to a suitable support, such as 
cellulose. This nucleoside represents the 3' end. Chain elongation proceeds from 3' to 5'; each cycle being 
composed (in one conventional method) of the following steps: 
5 (1) Selective deprotection 

For example, if the 5'-hydroxyl is protected by a dimethoxytrityl group, it is removed with acid. 

(2) Condensation 

A protected nucleotide is coupled to the exposed 5' end. The protected nucleotides may be 5'-0- 
dimethoxytrityl-N 6 -(benzoyl)-2'-deoxyadenosine, 5*-dimethoxytrity l-N 4 -(anisoyl)-2'-deoxycytidine, 5'-0- 
io dimethoxytrityl-N 6 -(N\N\-di-n-butyl formadine)-2'-deoxyadenosine, and 5'-0-dimethoxytrityl-N 2 -(pro- 
pionyl)-0-(diphenylcarbamoyl)-2'-deoxyguanosine. 

(3) Capping 

Unreacted 5'-hydroxyl groups are protected, e.g., by acylation. 
The traditional method for DNA sequencing by chemical cleavage depends on the parallel execution of 
75 four base-specific or base-selective modification protocols and the parallel electrophoretic resolution of the 
hydrolysates in four lanes. It is also possible to analyze DNA based on a single base modification 
procedure, if it produces some degree of backbone cleavage at all bases in the DNA but the rates of 
cleavage at the four canonical bases (A, T, G, C) are clearly different. See Ambrose and Pless, Meth. 
EnzymoL 152:522 (1987) (modification with 0.5M aqueous piperidine, 0.3M NaCl, 90 *C, pH > 12, 5 hrs). 
20 The single reagent method is faster but less accurate. 

Polysaccharides are larger polymers of monosaccharides in a branched or unbranched chain. Oligosac- 
charides are shorter polymers of monosaccharides, such as di-, tri-, tetra-, penta-, and hexasaccharides. For 
the sake of convenience, the term "polymeric carbohydrate" will be used to cover both poly- and 
oligosaccharides. 

25 Monosaccharides in a polymeric carbohydrate library may be aldoses, ketoses, or derivatives. They 
may be tetroses, pentoses, hexoses or more complex sugars. They may be in the D-or the L-form. Suitable 
D-sugars include D-glyceraldehyde, D-erythrose, D-threose, D-arabinose, D-ribose, D-lyxose, D-xylose, D- 
glucose. D-mannose, D-altrose, D-allose, D-talose. D-galactose, D-idose, D-gulose, D-rhamnose, and D- 
fucose. Suitable L-sugars include the L-forms of the aforementioned D-sugars. 

30 A sugar hemiacetal may be reacted with a hydroxyl group of another sugar to form a disaccharide, and 
the reaction may be repeated. For carbohydrate synthesis methods, see Kanie, O. and Htndsgaul, O., 
"Synthesis of Oligosaccharides, Glycolipids and Glycopeptides," Curr. Opin Struc. Bio. , 2:674-681, (1992). 
For sequencing, see Y.C. Lee, "Review: High-Performance Anion-exchange Chromatography for Carbohy- 
drate Analysis," Anal. Biochem. 189:151-162, (1990); Maley, F., Trimble, R.B., Tarentino, A.L., Plummer, 

35 T.H., "Review: Characterization of Glycoproteins and Their Associated Oligosaccharides Through the Use of 
Endoglycosidases," Anal. Biochem. , 180:195-204, (1989); and Spellman, M.W., "Carbohydrate Characteriza- 
tion of Recomibianant Glycoproteins of Pharmaceutical Interest," Anal. Chem. 62:1714-1722, (1990). 

Special constructs which have been described recently include: 

40 

Hybrid Polypeptide /Nucleic Acids: In this type of amino acid derivative, the R group of the Ca is a 
nucleotide residue. The backbone may be a regular polypeptide and thus we assume that there should be 
no difficulty in synthesis and in Edmann degradation. See Meier, C. and Engels, J.W. Peptide nucleic acids 
(PNAs) -- "Unusual properties of nonionic oligonucleotide analogs," Angew. Chem. (Engl) 31:1008-1010, 
45 1992; Egholm, M., Buchardt, O., Nielsen, P.E. and Berg, R.H., "Peptide nucleic acids (PNA). 
Oligonucleotide analogs with an achirat peptide backbone," Journal of the American Chemical Society 
114:1895-1897, 1992. 

Mixed" polymers of amino acids and other monomers: 

A single chain comprising amino acids and other monomers (e.g., nucleic acids) may be prepared. 

50 

Example 1 

In this example, we describe a hexapeptide library in which each residue is chosen from a set of 25 
amino acids. The library has a maximum possible "diversity" of 25 G . or about 2.44 x10* different peptide 
55 sequences. 

While considering the library as a whole, each residue position of the hexapeptides may be any of the 
25 residues of the set, the library is structured so residues 1-3 (numbered from the amino terminal) are 
familial residues, and residues 4-6 are individual residues. Unless "encoding" is needful, as explained 
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previously, for the peptide family on a single bead, the first three residues will be the same, the diversity 
being apparent only in residues 4-6. (Of course, residues 1-3 will vary from bead to bead). 

This library is constructed as follows. A mixture is prepared of 25 different, amino-protected amino 
acids. For three amino acid addition cycles, this mixture is reacted with all of the beads (i.e., an 
5 unstructured random cycle, abbreviated as "UR" in Table 1). As a result, each bead has a multitude of 
random tripeptides (positions 4-6 of the desired hexapeptides). Thus, there are 25 3 possible different 
tripeptide sequences. 

The remaining three cycles are structured random ("SR") cycles. In the fourth cycle, the beads are 
divided into 25 aliquots. The first aliquot is reacted with L-Ala, the second with L-Arg, the third with L-Asp, 

ro and so on through all 25 aliquots. We now have synthesized all 25* possible different tetrapeptide 
sequences. However, the amino terminal amino acid of all of the peptides on the beads of the first aliquot is 
L-Ala, while the amino terminal amino acid of all of the peptides of the second aliquot is L-Arg. (This amino 
acid will be residue 3 of the final hexapeptide). All of the beads are now mixed together randomly. In the 
fifth cycle, the beads are once again divided into 25 aliquots, and each aliquot reacted with a particular 

75 amino acid to yield pentapeptides on the beads. This is residue 2 of the final hexapeptide. The beads are 
packed and "shuffled," and, in the sixth and last cycle, divided into 25 aliquots to receive the final amino 
acid (residue 1 of the hexapeptide). The beads of the library now bear all 25 G possible hexapeptides, but 
the synthesis has been structured so that residues 1-3 (counting from the amino terminal) of the peptides 
are identical for the peptides on a given bead. 

20 The synthesis plan is summarized in the table below: 

Table 1 



Cycle Type 


Cycle 


Residue 


DF(cycle) 


PF(cycle) 


SR 


6 


1 


1 


25 


SR 


5 


2 


1 


25 


SR 


4 


3 


1 


25 


UR 


3 


4 


25 


1 


UR 


2 


5 


25 


1 


UR 


1 


6 


25 


1 




overall 


25 3 


25 3 



35 The sequence set statistics for this library appear below: 



Size 



40 



bead 
library 



6 
6 



x 
x 



10 
10 



13 

20 



Diversity 



15, 625 
2.44 x 



(25 3 ) 

10 s <25 6 ) 



Sampling 

4 x 10 9 



x 

5 



x 10 



12 



45 



50 



The safety factor analysis follows: 



Beads -in Assumed Detection Beads -in- Library 

Library Library PF Limit „ Safety Factor 

10 7 ~ 15,625 - 1 bead per library = 640 



Sampling level Assumed Detection Peptide-on- Bead 

Per Bead Limit Safety Factor 

4 x 10 9 + 10 6 4 x 10 5 

55 (molecules per (binding molecules 
sequence per bead) per bead) 
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On a single bead, there are 4x10 9 identical molecules of each different hexapeptide sequence, which is 
well above the assumed binding detection limit of 10 7 molecules. For the first three residue positions, there 
are 100 picomoles of each amino acid, which comfortably exceeds the 10 picomoles assumed to be 
required for sequencing. 

5 25 3 (15,625) different beads would be required to have at least one bead for each of the 25 3 possible 

permutations of the first three residue positions. If 10 7 beads are employed, there will be 10 7 /25 3 or about 

640 beads in the library bearing the identical familial sequence. 

If one of the beads was detected as "positive" when the library was assayed, the peptide molecules on 

it would be sequenced. All of them would have the same first three amino acids, in 100 pmole quantities. 
jo which is sufficient for sequencing. Thus, these three amino acid positions could be determined. However, it 

would not yet be known which of the 25 3 different peptide sequences on that bead was responsible for the 

binding activity. 

To find out, a secondary library is made. All peptides of this library have residues 1-3 in common. All 
peptides on a given bead have residues 4-6 in common. If the assay on the secondary library "marks" a 
75 bead, all of the peptide on this bead will have residues 1-6 in common, in 100 pmole quantities. Thus, 
sequencing is feasible, and the active peptide is then fully identified. 

It should be evident that larger active peptides may be determined by further iterative steps. 

Example 2 

20 

In this example, a 10 7 bead library of all of the 100 6 hexapeptides formable from 100 different amino 
acids is prepared. 

Suppose 10 7 beads are subjected to three unstructured cycles, in each of which the beads are reacted 
with a mixture of 100 different amino acids, and three structured cycles in each of which the beads are 
25 divided into 100 aliquots, each aliquot is reacted with a single unique amino acid out of the set of 100 
amino acids, and the aliquots are pooled back together after each cycle. 

The synthesis plan this time is: 

Table 2 

30 



Cycle Type 


Cycle # 


Residue # 


Cycle DF 


Cycle PF 


SR 


6 


1 


1 


100 


SR 


5 


2 


1 


100 


SR 


4 


3 


1 


100 


UR 


3 


4 


100 


1 


UR 


2 


5 


100 


1 


UR 


1 


6 


100 


1 




overall 


10 6 


10 fe 



This results in the following sequence set statistics: 





Size 


Diversity 


Sampling 


Bead 
Library 


6 x 10 13 
6 x 10 20 


10 6 
10 12 


6 x 10 7 
6 x 10 s 



The relationship of the sequence set statistics to the varying detection limits may be expressed through 
calculation of "safety factors", as follows: 
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beads - in- library 
10 7 

sampling level 

per bead 

6 x 10 7 * 



library PF 
10 6 

assumed 

detection limit 
10 6 



bead- in- library 
safety factor 
10 

peptide- on-bead 
safety factor 
60 



w The derivation of these numbers is explained in greater detail, below. 

The diversity of this library (D L ) is 100 6 (10 12 ). If there are 10 7 beads in the library (B L ), and 6x1 0 13 
molecules/bead (Ms), there are 6x1 0 20 molecules in the library (Ml). It is therefore possible to have 
(SxlO^/IO 12 - 6x10 s ) identical molecules of each unique peptide sequence. 

Each bead has a diversity of 100 3 (10 6 ). With 6x1 0 13 molecules on the bead, the expected number of 

is identical molecules per bead is (6x10 l3 /10 6 = 6x1 0 7 ), which is well above the 10 6 believed necessary for 
detection. There are also 100 3 (10 6 ) different synthetic "paths" taken by the beads during the structured 
cycles, and therefore the expected number of beads which underwent any given path is (10 7 /100 3 = 10), 
well above the desired minimum level of 1-2. Finally, 6x10 13 hexapeptide molecules per bead is the 
equivalent of 100 picomoles/bead. Since all molecules on a single bead have the same first three amino 

20 acids, this initial tripeptide is present at a concentration of 100 picomoles, whereas 25 is deemed desirable 
for sequencing. 

Thus, the foregoing demonstrates the practicality of screening hexapeptide library with each residue 
chosen from a set of 100 different amino acids. 



25 Example 3 

Another way of synthesizing a library of all possible hexapeptides which could be prepared from 100 
different amino acids is by a combination of six partially structured random cycles. 

In cycles 1-3, residues 4-6 are provided. In each cycle, the beads are divided into four aliquots A, B, C 
30 and D. Aliquot A is reacted with amino acid mixture A' (AAs 1-25), aliquot B with mixture B' (AAs 26-50), C 
with C (AAs 51-75), and D with D" (AAs 76-100). At the end of each cycle; the aliquots are repooled. 

In cycles 4-6, residues 1-3 are added. In each cycle, the beads are divided into 50 different aliquots, 
and each aliquot is reacted with a unique mixture of two of the 100 different amino acids. Where possible, a 
difficult-to-sequence amino acid is paired with an easy-to-sequence amino acid. 
35 This synthetic plan is summarized below: 



Table 3 



Cycle Type 


Cycle 


Residue 


Cycle DF 


Cycle PF 


SR 


6 


1 


2 


50 


SR 


5 


2 


2 


50 


SR 


4 


3 


2 


50 


SR 


3 


4 


25 


4 


SR 


2 


5 


25 


4 


SR 


1 


6 


25 


4 




overall 


50 3 


200 3 



so The sequence set statistics and library safety factors are as follows: 



55 
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Size 



Diversity 

125,000 
10 ,J 



Sampling Level 

4.8 x 10 s 
6 x 10 s 



bead 
library 



6 x 10 
6 x 10 



13 



5 



Beads- in- Library Library PF 
10 7 -s- 8 x 10 6 



Bead- in- Library 
Safety Factor 



1.25 



TO 



Sampling level 
Per Bead 



Assumed 
Detection Limit 

10 6 



Pep t ide - on - Bead 
Safety Factor 



4.8 x 10 s 



480 



T5 



If each bead takes up, in a given cycle, 100 pmoles of amino acid, residues 1-3 will be sequenceable, 
as each of two possible amino acids will be present in a concentration of (100/2 = ) 50 pmoles (five-fold 
more than the 10 pmotes assumed necessary for sequencing). The diversity per bead will be 2 3 x25 3 , or 
20 125,000. The expected number of identical molecules of each sequence will be (6x1 0 13 /1 25,000), or 5x10 s . 

The partitioning factor of the library will be 50 3 x4 3 , or 8x1 0 6 . With 10 7 beads in the library, the expected 
number of beads representing each possible partitioning permutation will be (10 7 /8x10 6 =) 1.25. 

Example 3A: Screening of the primary library for TNF binding. 



To illustrate the technique, the above library is screened for beads which specifically bind TNF as 
follows: The beads are mixed with a solution containing TNF (Tumor Necrosis Factor) at a concentration of 
10 ug/ml in 5% low fat milk buffer. Following washing with phosphate-buffered saline (PBS), the beads are 
incubated with rabbit antibodies specific for TNF, washed as before and incubated with antibodies specific 

30 to rabbit immunoglobulin which are conjugated with alkaline phosphatase (anti-rabbit Ig-alkaline 
phosphatase). After washing, the beads are exposed to the substrate BCIP (5-bromo-4-chloro-3-indolyl 
phosphate). Those beads which bound the ligand and/or the immunoglobulins are stained blue. 

The stained beads are collected with a micropipetor, under a microscope and destained by DMF 
treatment. The bound proteins (TNF and antibodies) are removed by washes with 0.1 N HCI. The destained 

35 beads are reacted with the anti-TNF and anti-rabbit IG-alkaline phosphatase conjugate without preincubation 
with TNF. Some of the beads, which are stained by the antibodies are removed. The peptides on the beads 
stained at this stage apparently bind to the antibodies or to the alkaline phosphatase and do not bind TNF. 
(Of course, a different target could be substituted for TNF). 

40 Example 3B: Screening the selected TNF-binding beads of the primary library for TNF-TBP1- 
complexes binding. 

The unstained beads from the previous stage are reacted with the TNF as before and then reacted with 
TBP1 (TNF Binding Protein p55, or soluble TNF receptor type 1). The beads are then reacted with rabbit 

45 antibodies specific to TBP1 and with the anti-rabbit Ig antibodies conjugated with alkaline phosphatase as 
before. The beads are then stained by reaction with BCIP. The blue beads apparently bind TNF in an 
orientation which allows the bound TNF to bind TBP1. The unstained beads apparently bind TNF in an 
orientation which blocks the active site. The beads which were unstained when exposed to the TBP1 and its 
antibodies, are then reacted with antibodies to TNF as before in order to verify that they still bind the TNF. 

so The stained beads are then subjected to N-terminal sequencing. The peptides on these beads inhibit TNF 
activity by inhibiting its binding to the soluble Type 1 receptor, the TBP1. Since the soluble receptor is the 
extracellular portion of the cell surface receptor, the peptide, by binding also to the latter, should inhibit the 
binding of TNF to the receptor and thereby ameliorate the harmful influence of TNF. 

55 Example 3C: The secondary library 

The sequencing of each bead selected from the primary library yields two amino acids for each of the 
1-3 positions. In the current example, each position contains 2 amino acids. Therefore, there are 8 different 
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possible tripeptides. For each three amino acids sequence obtained from analysis of the selected beads, we 
synthesize 8 different secondary libraries, each expressing a single tripeptide of the possible 8. Positions 4- 
6 of each of the 8 secondary libraries contain two amino acids per position, as performed for positions 1-3 
in the initial library, each secondary library contains 10 6 beads in order to allow full expression of all 
5 possible hexapeptides from the set of 100 different amino acids used to construct the library. 

The 8 secondary libraries, based upon the results of the sequencing of the first 3 positions in a selected 
bead obtained from the primary library, are synthesized according to the following table: 

Table 3c 

10 



Cycle 


Residue 


Cycle DF 


Cycle PF 


6 


1 


1 


1 


5 


2 


1 


1 


4 


3 


1 


1 


3 


4 


2 


50 


2 


5 


2 


50 


1 


6 


2 


50 


overall 


8 


50 3 



The partitioning factor for the library is 50 3 (125,000), for which 10 6 beads are adequate. The diversity 
of each bead is only 8, residues 4-6 are sequenceable (residues 1-3 are identical throughout a given 
secondary library). The diversity of the secondary library is 100 3 . 

25 Each of the secondary libraries thus synthesized is probed as described above for probing of the 
primary library. The identity of the most highly stained library indicates the exact sequence of the tripeptide 
in positions 1-3 from the N-terminal. The most darkly stained beads from the library are selected and 
subjected to N-terminal sequencing. The sequence of the first 3 positions should be known from the 
synthesis of this library (one of the 8 possible peptides), and the sequence information for positions 4-6 

30 (two possible amino acids in each), is the basis for the tertiary library which is described below. 

Example 3D: The tertiary library 

Since the diversity of each bead in the secondary library is only 8, the sequencing of the peptides on 
35 an active bead will reveal that one or more of the 8 possible sequences is an active sequence. The tertiary 
library described in this example has a dual purpose: identification of the exact sequence in residues 4-6; 
and exploration of those nonapeptides, formed of the same 100 amino acids, which begin with the active 
hexapeptide sequence. 

There are eight tertiary libraries, each representing one of the eight possible sequences at residues 4-6 
40 for the active bead from one of the secondary libraries of the last example, for a given tertiary library, the 
synthetic plan is as follows: 

Table 3d 



Cycle 


Residue 


DF(bead) 


PF(library) 


9 


1 






8 


2 






7 


3 






6 


4 






5 


5 






4 


6 






3 


7 


2 


50 


2 


8 


2 


50 


1 


9 


2 


50 


overall 


8 


50 3 
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This will yield a bead diversity of 8 and a library diversity of 100 3 . If there is an active bead, residues 4- 
6 will be known (by virtue of knowing which tertiary library the bead was in), and residues 7-9 will be limited 
to one of eight possibilities {as shown by sequencing those residues of the bead). 

It should be apparent that it would be possible to synthesize the next eight quaternary dodecapeptide 
5 libraries to identify residues 7-9 while exploring possibilities at positions 9-12, and so forth through further 
generations of libraries. 

Example 4 

to In this example, an octa peptide library is presented. For the purpose of this example, we assume that 
there are 10 8 beads in the library, 6 x 10 13 peptides (100 pmoles) per bead, a detection limit of one active 
bead with 10 5 active molecules thereon, and a sequence limit of 10 pmole/residue per bead. The library is 
constructed as follows: 

T5 Table 4 



20 
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Type 


Cycle 


Residue 


DF 


PF 


SR 


8 


1 


1 


40 


SR 


7 


2 


1 


40 


SR 


6 


3 


1 


40 


SR 


5 


4 


1 


40 


UR 


4 


5 


40 


1 


UR 


3 


6 


40 


1 


UR 


2 


7 


40 


1 


UR 


1 


8 


40 


1 



This library therefore has the following statistics: 



30 





Size 


Diversity 


Sampling 


Level 


PF 


Library statistics 
Bead statistics 


6 x 10 21 
6 x 10 13 


6.55 x 10 12 

2.56 x 10 6 


9x10 s 
2.4 x 10 7 


2.56 x 10 6 





Safety Factor 


detect bead-in-library 
detect peptide-on-bead 
sequence residues 1-4 


400 

24 

10 



45 In general, if the number of beads in the library (B L ) is within one or two orders of magnitude of the ratio 
(Ma/M B ') where M B is the number of molecules per bead and M B ' is the molecule-per-bead detection limit, 
the best strategy is to employ equal numbers of structured and unstructured random cycles, e.g. 4 and 4 
for an octapeptide library. 

However, this assumption may not always be valid. Suppose that there are 1(F beads in the library, but 
50 the detection limit is 10 8 molecules per bead. 

A four-and-four strategy would lead to these statistics: 







Size 


Diversity 


Safety Level 


PF 


55 


library 


6 x 10 22 


6.55 x 10 12 


9 x 10 9 


2.56 x 10 9 




bead 


6 x 10 13 


2.56 x 10 6 


2.47 x 10 7 
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Safety Factor 


detect bead-in-library 
detect peptide-on-bead 
sequence residues 1-4 


400 
.24 
10 



It would be safer to employ 5 structured and three unstructured random cycles: 





Size 


Diversity 


Sampling Level 


PF 


library 


6x 10 22 


10 12 


9 x 10 9 


2.56 x 10 9 


bead 


6 x 10 13 


6.4 x 10 4 


9 x 10 s 





75 





Safety Factor 


detect bead-in-library 


10 


detect peptide-in-bead 


9 


sequence residues 1-4 


10 



EXPERIMENTAL EXAMPLE A 

1 . Synthesis of a model peptide: N-Ala-Leu-Pro (PLA) on Eupergit C resin. Using the conventional 
solid phase synthesis with Fmoc protected amino acid we synthesized the PLA sequence on epoxy 
activated Eupergit C (Rohm, Darmstadt, Germany; alternatively, Spectra/Cryl, Spectrum, Houston, Texas. 
USA) resin (bead diameter 250u), after introduction of cystamine as linker, and evaluated the peptide 
yield per bead by amino acids microsequencing. By this we confirmed our synthesis method and 
reagents and demonstrated that we can determine the amino terminal sequence of one bead (about 20- 
40 pmole amino acid per bead). The ability to sequence one bead is essential for the peptide library 
approach. The average yield per bead of each amino acid from the sequences, based on sequencing 20- 
30 beads, was as follows 



Cycle 


Amino Acid 


Yield (pmole) 


1 


Ala 


63.9 


2 


Leu 


45.1 


3 


Pro 


31.1 



2. Synthesis of a peptide library on the Eupergit C beads. The library was prepared from 37 
different amino acids (the genetically encoded amino acids, except for L-cysteine; their D-forms, except 
for D-isoleucine; plus L-Norleucine) and constructed from six random {entire library receives a mixture of 
45 all the amino acids) steps followed by three structured (each 1/37 of the library gets a single amino acid) 
steps. This library was used first to test, by sequencing, the introduction of all the different amino acids, 
when alone or in a mixture. We demonstrated by sequencing that all of the amino acids were 
represented in the library. The following table sets forth the average yields, in pmoles per bead, obtained 
by sequencing the beads. 

50 



55 
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Cycle Sc 
Type 


Ala 


Arg 


Asn 


Asp 


Cys 


Glu 


Gin 


Gly 


His 


He 


struc- 
tured 


133 


151 


80 


244 


N/A 


138 


106 


188 


77 


195 


random 


582 


125 


113 


338 


N/A 


214 


234 


253 


48 


29 



Leu 


Lys 


Met 


Phe 


Pro 


Ser 


Thr 


Trp 


Tyr 


Val 


217 


191 


294 


263 


131 


21 


83 


78 


62 


249 


244 


155 


313 


408 


234 


21 


29 


27 


233 


52 



20 

In the structured region, most of the amino acids were represented in equimolar amounts (some as 
Serine gave very low peak due to the sequencing problem), while in the random region the distribution 
was nonequimolar. There was a deviation from random representation of about an order of magnitude 
from the best to the worst represented amino acids. The favored amino acids were Alanine, Aspartic 

25 acid. Phenylalanine, Methionine and Leucine. The worst represented amino acids were Serine, 
Threonine, Isoteucine and Tryptophan and Valine. For Isoleucine and Valine we observed that in the first 
3 positions, where each amino acid was incorporated alone, they were highly represented, but were less 
well represented in positions 4-6 where they were incorporated from a mixture. Thus we could 
distinguish between difficulty in relative coupling efficiency and difficulty in sequencing. Since the N- 

30 terminus of the peptides that contains the structured part is unique and is the important part in this type 
of library, we were not concerned by the differences between the presentation rates of the amino acids 
at the random part. 

3. Screening of the Eupergit C peptide library with Rhodamine labeled TBP1. Purified TBP1 (TNF 
binding protein; soluble extracellular domain of TNF receptor type 1, p55) {recombinant human, CHO 

35 produced, affinity purified TBP1 was obtained from InterPharm Laboratories, Ltd.) was labeled with 
rhodamine and applied to the peptide library. The result was a background pink staining of the beads 
which probably resulted from the high hydrophobicity of the Eupergit C matrix. We also observed 
fragility of the beads. Because of these two reasons we decided not to use this resin further and to try a 
"classical" solid phase synthesis resin, namely polystyrene cross linked with divinyl benzene (PS-DVB). 

40 4. Synthesis of the PLA model peptide on aminomethylated polystyrene 1% DVB resin. The 
tripeptide was synthesized on the DVB-poiystyrene resin (bead diameter 150u) as in paragraph 1, with 
similar results. 

5. Surface staining and ELISA model experiments with TBP1 -conjugated beads. Purified TBP1 
was immobilized onto aminomethylated polystyrene beads {1.5 ml of packed resin was reacted with 1.5 
45 mi of 10% glutaraldehyde for 30 minutes and 1 mg of TBP1 was added conjugated for 1 hour) and the 
beads were immunostained with an alkaline phosphatase labeled monoclonal antibody {see EP Appl. 
412, 486) or with polyclonal rabbit antibodies to TBP1 followed by alkaline phosphatase labeled anti- 
rabbit IgG. 

Two signal generation systems, both mediated by alkaline phosphatase, were tested: surface staining 
so of the beads with the insoluble product produced by the substrate system 5-bromo-4-chloro- 
indolylphosphate/nitroblue tetrazolium (BCIP/NBT), and the soluble color produced from the para- 
nitrophenyl phosphate {pNPP) substrate. 

In the first system, when staining with monoclonal antibodies, beads (about 50 ul packed) were 
incubated with 5% FBS (fetal bovine serum) in PBS and then incubated with monoclonal antibody 
55 against TBP1 conjugated with alkaline phosphatase at a concentration of 1-10 ug/m! (in FBS/PBS buffer) 
for 30 minutes. After washes with PBS containing 0.05% Tween-20, 330 ul per tube of BCIP/NBT 
substrate (Bio Rad kit catalogue number 170-6432, diluted 1:100/1 100 according to the manufacturer's 
instruction in 0.1 M Tris pH 9.5) were added. When staining with polyclonal antisera, beads (about 50 ul 
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packed) were incubated with 5% FBS {fetal bovine serum) in PBS and then incubated with rabbit 
polyclonal serum against TBP1 diluted 1:2000 (in FBS/PBS buffer) for 30 minutes. After washes with 
PBS containing 0.05% Tween 20. alkaline phosphatase conjugated to goat anti rabbit IgG {Bio Makor 
catalogue number 3471) diluted 1:1000 (in FBS/PBS buffer) was added, for 30 minutes incubation and 

5 then washes and addition of substrate as above. 

In the second system, the procedure was as above for polycolonal antisera, but the substrate added 
was para-nitrophenyl phosphate (pNPP, Sigma 104™, tablet of 5 mg in 10 ml of 10mM diethanolamine, 
pH 9.5 containing 0.5 mM MgCI 2 ), instead of BCIP/NBT, and the soluble color was produced at 37 *C. 
The resulted O.D. of each sample were monitored at 405 nm. in microtiter plate wells. 

10 The results showed high specific positive surface staining (with BCIP/NBT) of the TBP1 -coupled 

beads as compared to staining of control beads or staining of TBP labeled beads with control antibodies. 
The background surface staining was essentially nonexistent. We could easily identify and pick up one 
stained bead from among thousands of unstained beads under a transmission microscope or a 
reflectance stereo microscope (dissecting microscope). While the soluble substrate detection system 

T5 was not sensitive enough to allow detection of a single stained bead, it could detect as few as ten 
positively stained beads in one microliter well, as compared to the control. 

Other signal generating systems that were tested included the use of chemiluminescence substrate, 
staining of the beads with colloidal gold-labeled second antibodies or use of 125 l-labeled probes. 
However, none of these systems were sensitive enough to allow detection of signal from single beads. 

20 While such a system is not required for the present invention, it does permit one to increase the diversity 
of the library. 

6. Synthesis of a peptide library on aminomethylated polystyrene beads. The library was 
synthesized using the 37 amino acids and constructed of six "semi random" steps and three structured 
steps to yield: NH 2 -1-1-1-2-3-4-5-6-7-COO-NH-CH 2 -bead {each number represents the number of 

25 different amino acids added to each portion of the beads at given coupling step). This strategy was used 
in order to increase the number of presented peptides without increasing the library size (volume or 
number of beads) Each bead is therefore unique in the 3 positions starting at the amino terminus end, 
contains two different amino acids at position 4, three at position 5 and so on. We believe that there is a 
need for at least pentapeptide in order to achieve a binding affinity sufficient for immunodetection of the 

30 beads. 

The amino acids actually used in the semi-random steps were as given below, with "position" being 
measured from the amino terminal. In the subheadings, such as "9-3", the first number is the residue 
position and the second number is the group number. Each group is a mixture of amino acids with which 
an aliquot of beads is reacted. The number of groups is equal to the partitioning factor. Group 9-3 is one 
35 of five groups, and each of the groups for position 9 is composed of 7 or 8 amino acids. 



Position 9: 


9-1 


9-2 


9-3 


9-4 


9-5 


L-Ala 


L-His 


L-Pro 


D-Arg 


D-Met 


L-Arg 


L-lle 


L-Ser 


D-Asn 


D-Phe 


L-Asn 


L-Leu 


L-Thr 


D-Asp 


D-Pro 


L-Asp 


L-Lys 


L-Trp 


D-GIn 


D-Ser 


L-GIn 


L-Met 


L-Tyr 


D-Glu 


D-Thr 


L-Glu 


L-Nle 


L-Val 


D-His 


D-Trp 


Gly 


L-Phe 


D-Ala 


D-Leu 


D-Tyr 








D-Lys 


D-Val 



50 
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Position 8: 


8-1 


8-2 


8-3 


8-4 


8-5 


8-6 


L-Ala 


Gly 


L-Nle 


L-Tyr 


D-GIn 


D-Phe 


L-Arg 


L-His 


L-Phe 


L-Val 


D-Glu 


D-Pro 


L-Asn 


L-Lle 


L-Pro 


D-Ala 


D-His 


D-Ser 


L-Asp 


L-Leu 


L-Ser 


D-Arg 


D-Leu 


D-Thr 


L-GIn 


L-Lys 


L-Thr 


D-Asn 


D-Lys 


D-Trp 


L-Glu 


L-Met 


L-Trp 


D-Asp 


D-Met 


D-Tyr 












D-Val 



Position 


7: 










7-1 


7-2 


7-3 


7-4 


7-5 


7-6 


L- Ala 


L-Glu 


L-Lys 


L-Ser 


D- Ala 


D-Glu 


L-Arg 


Gly 


L-Met 


L-Thr 


D-Arg 


S-His 


L- Asn 


L-His 


L-Nle 


L-Trp 


D- Asn 


D-Leu 


L-Asp 


L-Ile 


L-Phe 


L-Tyr 


D-Asp 


D-Lys 


L-Gln 


L-Leu 


L-Pro 


L-Val 


D-Gln 


D-Met 
D-Phe 



7-7 
D- Pro 
D-Ser 



35 
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D-Thr 
D-Trp 



5 


D-Tyr 
D-Val 

Position 


6 : 








10 


6-1 


6-2 


6-3 


6-4 


6-5 




L- Ala 


L- Gin 


L- lie 


L-Nle 


Li - i nir 




L-Arg 


L-Glu 


L-Leu 


L-Phe 


L-Trp 




L- Asn 


Gly 


L-Lys 


L-Pro 


L-Tyr 


15 


L- Asp 

6-7 
D-Gln 


L-His 

6-8 
D- Lys 


L-Met 

6-9 
D-Ser 


L- Ser 


Li - vax 


20 


D-Glu 
D-His 
D- Leu 


D-Met 
D-Phe 
D- Pro 


D-Thr 
D-Trp 
D-Tyr 






25 


Position 


5 : 


D-Val 








5-1 


5-2 


5-3 


5-4 


5-5 


30 


L- Ala 


L- Asp 


Gly 


L- Leu 


T Ml d 
Li - VilB 




L-Arg 


L-Gln 


L-His 


L- Lys 


Li - rae 




L- Asn 


L-Glu 


L-Ile 


L-Met 


L-Pro 


35 


5-7 


5-8 


5-9 


5-10 


5-11 




L-Tyr 


D- Arg 


D-Gln 


D - Leu 


L> r Me 




L-Val 


D- Asn 


D-Glu 


D- Lys 


U ~ tr L \J 




L- Ala 


D- Asp 


D-His 


D-Met 


D- Ser 


40 


Position 


4 : 










4 - 1 


4-2 


4-3 


4-4 


4-5 


45 


L- Ala 
L-Arg 


L- Asn 
L- Asp 


L-Gln 
L-Glu 


Gly 
L-His 


L-Ile 
L- Leu 




4-7 


4-8 


4-9 


4-10 


4-11 


50 


L-Nle 


L - Pro 


L-Thr 


L-Tyr 


D- Ala 




L-Phe 


L - Ser 


L-Trp 


L-Val 


D - Arg 



6-6 
D- Ala 
D-Arg 
D- Asn 
D- Asp 



5-6 
L-Ser 
L-Thr 
L-Trp 

5-12 

D-Thr 

D-Trp 

D-Tyr 

D-Val 

4-6 

L-Lys 

L-Met 

4-12 
D- Asn 
D - Asp 
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4-13 


4-14 


4-15 


4-16 


4-17 


4 - 18 


D-Gln 


D-His 


D-Lys 


D-Phe 


D-Ser 


D-Trp 


D-Glu 


D- Leu 


D-Met 


D- Pro 


D-Thr 


D-Tyr 



D-Val 



7. First screening of the polystyrene peptide library with TBP1. 

to One quarter (400 ul packed) of the total number of beads were blocked with 5% FBS in PBS (1 ml for 
45 minutes), to avoid nonspecific binding, and incubated with TBP1 (1 ml at a concentration of 10u.g/ml 
in FBS/PBS for 45 minutes) followed by polyclonal rabbit anti-TBP1 (1:500 in FBS/PBS, 1 ml for 1 hour) 
and alkaline phosphatase conjugated goat anti-rabbit Ig antibodies (1:2000 in FBS/PBS, 1 ml for 40 
minutes). The signal was generated by BCIP/NBT substrate (12 ml in 9 cm petri dish; further details as 

75 in 5 above). More than 50 stained beads were selected and three of them were subjected to sequencing. 
Two additional beads that were selected from a screening of a second quarter of the library with 100 
ng/ml TBP1 (signal generated with FAST-RED (4-chloro-2-methy!benzyl diazonium salt supplied by 
Sigma, St. Louis, Missouri, USA, used in combination with naphtol AS-MX, 3-hydroxy-2-naphtoic acid 2, 
4-dimethyl anilide) substrate were also sequenced. The obtained sequences of the five beads are: 

20 1. N-Ala-Cly-Met-(Pro, Phe, Ser?)-(Phe, Leu, Met) 

2 N-Ala-Glu-Met-Ser?-Ser? 

3 N-Val-Gln-Pro 

4. N-Trp-Glu-Pro-G!u 
5 N-Ser-Lys-Val-Leu-(Phe, Pro) 
25 8. Testing the specificity of the selected beads. The beads that were selected were stained because 
of their ability to bind either the TBP1, the antibodies to TBP1, the goat antibodies against rabbit IgG or 
the alkaline phosphatase. It was therefore necessary to distinguish the beads that bind antibodies and 
alkaline phosphatase from those that bind the TBP1. One way to verify the bead specificity is to destain, 
and then restain the beads with all the components but without the target ligand, TBP1. The beads that 
30 would be stained by the antibodies alone would be removed as non-specific binders. 

9. Destaining and restaining. The BCIP/NBT substrate that we used in order to stain the beads could 
not be destained by any treatment that we tried (organic solvents, urea, SDS. NaOH, boiling, sonication 
and combinations of all of the above), and we therefore decided to use another alkaline phosphatase 
substrate which could be readily destained. The use of the substrate FAST RED resulted in satisfactory 

55 staining, destaining and restaining of model beads (absorbed with alkaline phosphatase labeled IgG). For 
reasons unknown, stained library beads could not be restained. Replacing the NBT by MTT gave the 
desired results and beads that were stained could be destained with short DMF wash. 

10. Selection methods. When screening a small volume (<5 ml packed) of beads library, where the 
number of stained beads is relatively small, it is possible to pick up the selected stained beads with a 

40 forceps or a mtcropipetor. When a large number of stained beads should be picked up, manual handling 
might take too much time. We therefore tested several methods for faster and more convenient sorting. 
We succeeded in separation of positive model bead (TBP1 immobilized covalently) from negative beads 
by reacting them with smaller magnetic beads coupled to anti-rabbit antibodies. The anti-rabbit anti- 
bodies on the magnetic beads bind the TBP carrying beads which were previously reacted with rabbit 

45 anti-TBP1 antibodies. The magnetically labeled beads were separated with a magnet. The separation 
conditions were as follows: 
TBP1 -beads - as in 5. 

control beads-aminomethylated polystyrene beads without modification. 
A mixture of 1% TBP1-beads in control beads (total 50 ul beads), 
so Blocking- 1 ml of 5% low fat milk in PBS + 0.05% Tween 20. 

Purified rabbit polyclonal serum against TBP1 6 u.g/ml in FBS/PBS - 1 ml for 30 minutes. 
Washes with PBS containing 0.05% Tween 20. 
Sheep anti rabbit IgG coupled to magnetic beads - 

(Dynabeads™ M-280, Dynal catalogue number 112.03. 2.8 urn diameter), diluted 1:100 in blocking 
55 buffer - 1 ml for 30 minutes. 

Magnetic separation + 2 washes using the "Magnetic particle concentrator" MPC®-1 (Dynal 
catalogue number 120.01). 
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Using such a system we demonstrated our ability to fish out all of the TBP1 coated beads from an 
excess of control beads. 99.9% of the beads at the bound fraction were TBP1-beads and only 0.1% 
were control beads. 

11. Confirmation of the sequence. The sequencer can not distinguish between L- and D-amino acids. 
Thus, the sequence information is not complete. Information about the first three amino acids allow for 
eight (2 3 ) possible sequences and we need to find out which is the correct one. From the first 2 beads 
which were sequenced we obtained 2 partially overlapping sequences: N-Ala-Cly-Met and N-Ala-Glu-Met. 
The 12 possible peptides obtained from the sequences of the first two selected beads were synthesized 
on beads containing random collections of all the amino acids at positions 1-6 from the C-terminal. The 
12 peptides (each synthesized on around 100,000 beads) were tested for TBP1 binding by the regular 
procedure but with the soluble substrate. The peptide N-DAIa-DGIu-LMet gave a high OD (ten times 
higher than background) and the other 1 1 peptides were lower (between background and three times the 
background). The specificity of the peptide to TBP1 was also confirmed by lack of response with 
antibodies alone. The resulting OD in presence of the TBP1 was four times higher than with the 
antibodies only. 

12. Synthesis of secondary libraries. The sequence found from the first analyzed bead was used for 
synthesis of 37 peptides on beads (150 u\ packed; about 200,000 beads) primed with 5 random steps. At 
the fourth place from the amino terminus the beads were divided into 37 groups and each group got a 
different amino acid. The three terminal amino acids were N-DAIa-DGIu-LMet. In this way we got 37 
peptides which differ at the fourth position. Staining of the 37 peptides (separately, using soluble 
substrate) indicated that the L-Serine is the desired amino acid at the fourth position. The process was 
repeated in order to find the amino acid at position five from the N terminal but at this point we realized 
some problems, described in the next paragraph, so we are not sure about the results obtained by the 
soluble substrate approach. 

13. Screening with soluble substrate (ELISA). ELISA assays were performed in mini-columns made 
from polypropylene syringes supported with polyethylene frit. Equal amounts of beads of each tested 
group are inserted to each column. Reagents and general procedure were as described for the model 
beads at five for the second system, 200-500 ul of reagent were added to each column at each step. 
After adding substrate, the columns were placed at 37 ■ C for about 60 minutes and then 200 ul substrate 
of each column (without the beads) were transfered to a microtiter plate well for O.D. reading at 405 nm. 

The use of a soluble staining product for monitoring differences between peptides seems very useful 
especially since the use of ELISA reader enables discrimination of small differences between staining 
intensities. However, when we further used it for secondary libraries, results indicated that it is not 
possible to obtain a positive binding response based upon contribution of only 3 specific amino acids (as 
compared to contribution of 5-6 amino acids in the original library). Further experiments indicated 
existence of artifacts resulting from high background observed when using the soluble substrate. The 
background is apparently attributable to the container, and not the beads. The problem, we believe, 
results from absorption of proteins to the polyethylene frits of the columns used as reaction vessels. It 
may be possible to block this absorption and thereby improve results. 

14. Secondary library and sequence confirmation for the sequence: N-Ser-Lys-Val. Eight secon- 
dary libraries based on the sequence obtained from screening of the library with TBP1 , 100 ng/ml (see 
7) were synthesized with the general structure: N-Ser-Lys-Val-1-1-1-R-R-R-Bead, R represents random 
step, 1 is one of 37 amino acids added to each bead in each step and at the three N positions, each of 
the eight libraries got one of the possible peptides (combination of the L and D isomers). Each 
secondary library contained about 50,000 beads intended to include all the possible combinations of the 
structured three amino acids at positions 4-6 from the N-terminal. The eight libraries were screened with 
surface staining. The results indicated that for the N terminal sequence: N-DSer-DLys-LVal (group no. 4) 
bead of the strongly stained beads of group 4 was analyzed and the sequence obtained was N-Ser-Lys- 
Val-Lys-Lys. 

15. Specificity of the N-DSer-DLys-LVal peptide. Several staining steps were performed with the 
library beads containing the N-DSer-DLys-LVal sequence in order to verify its specificity. Immunostaining 
performed with and without TBP1 revealed specificity to the antibodies. Further analysis revealed that 
the above sequence specifically recognized the rabbit immunoglobulins from both rabbit anti-TBPl 
antiserum and from normal rabbit serum. Immunoglobulins from other animal species and from humans 
were not recognized. 

16. Sensitivity of the surface staining. We experienced some difficulties in obtaining reproducible 
high surface staining results. No false positive results were ever experienced. We assume that the 
difficulty resulted from the nature of the substrate used. We attempted to increase the concentration of 
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reagents up to a level which we believed would ensure reproducible results, with low background. 
Control (without peptides) beads were tested with increasing TBP1 and antibodies concentrations and 
with different blockers. We have shown that background staining was negligible even with high 
concentration {e.g., above 10 mg/ml of monoclonals or purified polyclonals, and dilutions of below 1:100 

5 of crude polyclonals) of antibodies when using 5% low fat milk powder instead of 5% FBS, or when 
TBP1 concentration was 100 ug/ml (when using diluted antibodies, even with FBS blocker). We prefer to 
perform screening using high ligand concentration (10-100 ug/ml) and to use diluted antibodies in order 
to select as few as possible immunoglobulin and alkaline phosphatase binding peptides. 
17. Application of the selection procedures in screening for TBP1 binding peptides. The 

io currently available peptide library (described in par. 6), was stained for TBP1 and antibodies and stained 
beads were retrieved. The beads were destained and reacted with the antibodies without TBP1. Beads 
stained with the antibodies were removed. The beads were then reacted with TBP1, followed by reaction 
with TNF and staining with antibodies to TNF. Those beads that bound TNF were left aside as TBP1- 
TNF complex recognizers. Such peptides could probably be used for assaying of TBP1-TNF complex 

15 formation The beads were then restained with TBP1 and anti-TBP1 antibodies and the highest stained 
beads were selected. This staining procedure was repeated a second time in order to ensure specificity. 
A total of about 15 beads were recovered. Apparently, these beads bind TBP1 at a site that is close 
enough to the TNF binding site to inhibit the binding of TNF or else the binding of the peptide inhibits 
TNF binding by an allosteric mechanism. These peptides are candidates for TBP replacement therapy. 

20 In diagnostic they can be used in immunoassays for measuring only the free TBP1 (an information that 
can be very useful) and in production for affinity purification of TBP1 from production fluids. 

EXPERIMENTAL EXAMPLE B 

25 Model Staining of Peptide Library with Monoclonal Antibody to Human Endorphin 

This peptide library is a heptapeptide library in which the C-terminal amino acid is constant. The other 
six positions were varied. 

30 Materials 

Beads: 

a. Beads of high density peptide library #4 which was constructed according to the following parameters: 
35 Resin Type: polystyrene/divinyl benzene. 
Amino acids used =73 
Total hexapeptides diversity 1.5 x10 11 

Structure N-2. 28-2. 28-2. 28-5-24-73-L-L-L-Bead{L = linker)(Numbers represent the statistically expected 
average number of different amino acids incorporated on each bead in the indicated position. Since the 
40 groups incorporated in a given cycle may be of different sizes, the average number is not always a 
whole number. For example, 2.28 is the weighted average of 23 groups with 2. a. a. /group and 9 with 3 
a.a./group). 

All beads mixed together in one pool. 
Total peptides per bead = About 70,000 
45 Only about 0.2 repeat of hexapeptide per sequence were used on this experiment, other parts were used 
in previous experiments. 

In library 4, the groups were as follows (with amino acids identified by ID number): 
Position 6 (one group) 

Group 1 1-3, 5, 7-12, 14-23. 28, 29, 31-39, 42-48, 51-66, 68, 70-82, 85-89. 
so Position 5 (3 groups) 

Group 1 3, 7, 8, 9, 14, 15, 18-23, 28, 29, 31-33, 36, 38, 39. 54, 73-76. 
Group 2 1, 2, 5. 11, 16. 17, 52, 53, 55, 59, 60-62, 65, 71, 72, 77-80, 85-89. 
Group 3 10. 12, 34, 35, 37, 42-48, 51, 56-58. 63, 64, 66, 68, 70, 81, 82. 
Position 4(15 groups) 
55 Group 1 9,18,19,20,21 

Group 2 28, 29. 58, 73. 74 
Group 3 3, 23. 33, 75, 76 
Group 4 31,32,38,39,54 
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Group 5 


7, 8, 14, 15, 36 




Group 6 


16, 17, 37, 70, 89 




Group 7 


2, 5, 11, 79, 80 




Group 8 


1, 71, 72, 65 


5 


Group 9 


52, 53, 55, 61, 62 




Group 10: 


43, 85-88 




Group 1 1 : 


22, 59, 60, 77, 78 




Group 12: 


63, 64. 66, 68 




Group 13: 


34, 35, 42, 48, 51 


10 


Group 14: 


44-47, 82 




Group 15 


10, 12, 56, 57, 81 




Positions 1 -3 {32 groups) 




Group 1 


11, 20, 21 




Group 2 


18, 19 


15 


Group 3 


73. 74 




Group 4 


28, 29 




Group 5 


75, 76 




Group 6 


31, 32 




Group 7 


38, 39 


20 


Group 8 


7, 8 




Group 9 


14, 15 




Group 10 


16, 17 




Group 1 1 


79, 80 




Group 12 


71, 72 


25 


Group 13 


52, 53 




Group 14 


85, 86 




Group 15 


87, 88, 37 




Group 16 


59, 60 




Group 17 


77, 78 


30 


Group 18 


61, 62 




Group 19 


63, 64 




Group 20 


48, 51, 43 




Group 21 


44, 45 




Group 22 


56, 57 


35 


Group 23 


58, 42 




Group 24 


3, 33 




Group 25 


68, 66, 1 




Group 26 


22, 10, 12 




Group 27 


2, 65, 81 


40 


Group 28: 


46, 47, 23 




Group 29 


55, 82 




Group 30 


70, 5 




Group 31 


89, 34, 35 




Group 32 


36, 54 


45 


b. Beads of high density peptide library #6 which was constructed according to the parameters of library 



#4 with the following changes: 

Structure: N-(1 -2)-(2-3)-4-5-8-1 0-Asn-AcaE-Bead 

Total peptides per bead = About 7,500 

The beads of this library were not pooled after the last synthesis cycle (position 1 from the N-terminal) 
so but were left grouped according to their position 1 for the final deprotection and screening steps. Groups 
containing the N-terminal Tyr (group 11) and Pro (group 12) were checked in this experiment). 

The library may be further characterized as follows: 
Position 6 (7 groups) 

Group 1: 5,18-22,28,29,31.32 
55 Group 2: 1-3,33,38,39.73-76 

Group 3: 9-12.77-82 
Group 4: 43, 56-60, 85-88 
Group 5: 23, 44-48, 51. 61-64 
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TO 



75 



20 



25 



30 



35 



40 



45 



50 



55 



Group 6: 34, 35, 52-55, 69, 66, 68, 71, 72 

Group 7: 7,8.14-17,36.37,42,70,89 

Position 5 (9 groups) 

Group 1 18,19,28,29,73-76 

Group 2 7,8,14,15.20,21,31,32 

Group 3 63, 64, 66, 68, 71 , 72, 77, 78 

Group 4 44-47, 85-88 

Group 5: 16, 17, 38, 39, 52-55 

Group 6 10,12,34,35,56,57,59,60 

Group 7 48,51.61,62,79-82 

Group 8: 22, 37, 42, 43, 58, 65, 70, 89 

Group 9 1-3,5,9,11,23,33,36 

Position 4 (15 groups) 



Group 1 
Group 2 
Group 3 
Group 4 
Group 5 
Group 6 
Group 7 
Group 8 
Group 9 
Group 10 
Group 1 1 
Group 12 
Group 13 
Group 14 
Group 15 



20, 21, 18. 19, 9 
73, 74, 28, 29, 58 
75. 76, 23, 3. 33 
31, 32, 38, 39, 54 
7, 8, 36, 14, 15 
89, 16, 17, 70, 37 
1 1 , 79, 80, 5, 2 
52, 53, 55, 61, 62 
43, 85-88 
22, 59, 60, 77, 78 
34, 35, 42, 48, 51 
44-47, 82 
10, 12, 56, 57, 81 
1, 65, 71, 72 
63, 64, 66, 68 



Position 3 (18 groups) 



Group 1 
Group 2 
Group 3 
Group 4 
Group 5 
Group 6 
Group 7 
Group 8 
Group 9: 
Group 10 
Group 1 1 
Group 12 
Group 13 
Group 14 
Group 15 
Group 16 
Group 17: 
Group 18: 



18, 19, 28, 29 
73-76 

20, 21, 31, 32 
7, 8, 14, 15 
71, 72, 77, 78 
63, 64, 66, 68 
44-47 
85-88 
52-55 

16, 17, 38, 39 
56, 57, 59, 60 
10, 12, 34, 35 
48, 51, 61, 62 
79-82 

22, 42, 43, 56, 57 
37, 65, 70, 89 
1.5, 9, 11 
2, 3, 23, 33, 36 



Position 2 (24 groups) 



Group 1 
Group 2 
Group 3 
Group 4 
Group 5 
Group 6 
Group 7 
Group 8: 
Group 9: 
Group 10: 



11, 20, 21 
18, 19, 54 
28, 29, 36 
5, 75, 76 
31, 32, 70 
38, 39, 42 
7. 8 

14, 15, 82 
9, 77, 78 
23, 46, 47 
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Group 11" 


63, 64, 81 




Group 12 


48, 51, 43 




Group 13 


44, 45, 65 




Group 14 


56, 57, 89 


5 


Group 15 


68, 66, 22 




Group 16 


79, 80, 33 




Group 17 


52, 53, 1 




Group 18 


16, 17. 55 




Group 19 


71, 72, 3 


70 


Group 20 


85. 86. 2 




Group 21 


87, 88, 37 




Group 22 


59, 60 




Group 23 


10. 12, 34, 35 




Group 24 


73, 74, 61, 62 


75 


Position 1 (40 groups) 




Group 1 


20. 21 




Group 2 


18. 19 




Group 3 


73, 74 




Group 4 


28. 29 


20 


Group 5 


75. 76 




Group 6 


31. 32 




Group 7. 


38, 39 




Group 8: 


7. 8 




Group 9: 


14, 15 


25 


Group 10 


16. 17 




Group 1 1 


79, 80 




Group 12 


71, 72 




Group 13 


52, 53 




Group 14 


85, 86 


30 


Group 15 


87, 88 




Group 16 


59, 60 




Group 17 


77, 78 




Group 18 


61, 62 




Group 19 


63, 64 


35 


Group 20 


48. 51 




Group 21 


44, 45 




Group 22 


56, 57 




Group 23 


58, 42 




Group 24 


3, 33 


40 


Group 25: 


68, 66 




Group 26: 


10, 12 




Group 27 


34, 35 




Group 28 


46. 47 




Group 29 


55, 82 


45 


Group 30 


70, 5 




Group 31 


11, 23 




Group 32 


36, 54 




Group 33 


37, 81 




Group 34 


1 


50 


Group 35 


9 




Group 36 


89 




Group 37 


65 




Group 38 


2 




Group 39 


43 


55 


Group 40 


22 




c. Model beads 


(TentaGel bead 



Leu and n-His-Pro-Tyr-Pro-Pro. 

Monoclonal antibody to human ^-endorphin, (Boehringer Mannheim, Germany. Cat. No. 1089 170)' 
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reacts with the N-terminal Tyr-Gly-Gly-Phe sequence of human ^-endorphin. 

Polyclonal rabbit antibodies against mouse IgG conjugated with alkaline phosphatase (BioMakor, Israel. 
Cat. No. 3465). 

Blocking Buffer: 1% l-Block (Tropix, MA, USA) in PBS containing 0.05% NaN 3 , 0.1% Tween 20, 0.133 
5 g/L CaCl 2 2H 2 0 and 0.1 g/L MgCl 2 .6H 2 0. 

Wash buffer: PBS containing 0.05% NaN 3 and 0.1 Tween 20. 

BCIP, 5-Bromo-4 Chloro-3-lndolyl Phosphate (Sigma, Cat. No. B-0274} 25 mg dissolved in 0.5 ml 
dimethylformamide. 

10 Method: 

The peptide libraries were synthesized essentially as previously described. 

Staining of the beads was performed using the following steps (all incubation steps were performed with 
continuous mixing of the beads): 
75 a. Washes of the beads with wash buffer. 

b. Incubation of the beads with blocking buffer for 45 minutes to block non-specific interactions. 

c. Incubation of the beads with the monoclonal antibody to human ^-endorphin diluted to 200 ng/ml in 
blocking buffer for 40 minutes. 

d. Six washes with wash buffer. 

20 e. Incubation of the beads with the polyclonal rabbit antibodies against mouse IgG conjugated with 
alkaline phosphatase, diluted 1:1000 in blocking buffer for 40 minutes. 

f. Six washes with wash buffer and one wash with Tris (25mM), NaC1 (125 mM), Tween (0.1%). 

g. Incubation of the beads with BCIP diluted to 500 ug/ml in 0.1 M Tris pH 9.0, for 2 hours. 

h. Two washes with H 2 0, transfer of the beads to petri dishes and observation for blue stained beads 
25 (using a stereomicroscope). 

Results: 

The staining results are given in Table B-1: 

30 

Table B-1 



Staining of beads with anti-endorphin McAb clone 3-E7 


Bead Source 


Number of Blue-stained Beads 


Library 6: Tyr at N-terminal 
Library 6: Pro at N-terminal 


about 1 00 beads 
about 50 beads 



40 Some of the stained beads were submitted for N-terminal sequencing using gas phase peptide 
microsequencer (model 475A, Applied Biosystems) and the results are summarized in Table B-2: 

Table B-2 



Sequence Results of Blue-stained Beads 


No. 


Bead Source 


Position 1 


Position 2 


Position 3 


Sequence 


1 


Lib. 6:Tyr at n-terminal 


Tyr (Y) 


Gly (G) 


Gly (G) 


nYGG 


2 


Lib. 6:Tyr at n-terminal 


Tyr (Y) 


Met (M) 




nYM 


3 


Lib. 6:Pro at n-terminal 








no sequence yield 


4 


Lib. 4 


Lys (K) + Tyr (Y) 


Phe (F) + Thr (T) 


Thr (T) + Leu 
(L) + Gly (G) 


n-K/Y-F/T-T/UG 


5 


Lib. 4 


Gin (Q) 


Gly (G) 


Tyr (Y) 


nGGY 


6 


Lib. 4 


Tyr (Y) 


Gly (G) 




nYG 
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Discussion 

As can be seen the sequence in nYGG, that is part of the peptide sequence to which the antibody was 
raised, was identified by library 6 (70,000 hexapeptides per bead). These results prove our basic hypothesis 
5 about the ability to detect specific sequences synthesized on beads carrying many hexapeptides per bead. 

The other sequences obtained nYM and nQGY were not reported in the literature but could be correct 
due to the fact that those sequences may contain some non-conventional amino acids (that were not used 
in previous reports) in positions closer to the C-terminal and D isomers of the YGGF amino acids at any 
position, that can influence the binding to the antibody epitope. Furthermore, the amino acids Gin (Q) and 
to Met (M) recognized by as were also common in some of the literature reports but in other positions of the 
tetrapeptide sequence. The sequence nQGY seem to contain the reverse of nYG and might be recognized 
by the antibody. 
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Working Example C 

35 This section describes the synthesis and screening of a library prepared from glass beads. 

The glass beads were obtained from Potters-Balotini in France. Type 5000 beads were used. The 
claimed diameter of the beads is 0-30 microns. The beads were separated by 1G precipitation and a 
subtraction was obtained. The average diameter of the separated beads is about 7 microns. 

The capacity of the beads is - lumole/ml, which corresponds to about 300,000 peptides per bead. 
40 Thus, assuming there are 1000 different peptides per bead, each of them will be represented by 300 
peptides. 

The beads were washed with about 60% nitric acid for 5 hours. The acid washed beads were aminated 
with aminopropyltriethoxysilan {2% in ethanol for 1 hour at 95 *C). Coupling of amino acids was performed 
as follows: 

45 FMOC protected amino acid 25 mM in DMF. 



' 

PyBOP 


25 mM in DMF. 


HObt 


25 mM in DMF. 


NMM 


42 mM 



Coupling was performed for 1 hour with continuous. 

The library was built from the following 74 a-amine FMOC protected amino acids: 
1-3, 5, 7-12, 14-23, 28. 29, 31-39, 42-48, 51-66, 68, 70-82, 85-89, 120. 
55 The first incorporated amino acid (at the C-terminal) was 0-Ala. 

For the second amino acid the amino acids were grouped as shown in Table C-1: 
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Table C-l 



5 



10 



Steps 2, 3, 4, 5 only: Total 18 groups 




1 


4 


7 


10 


13 


16 


lOAlaC-D 


31Glu-D(Obut) 


58NleM-L 


46LeuM-D 


56Nle-D 


79Tyr-D(But) 


12AJaC-L 


32Glu-L(Obut) 


87ValM-D 


47LeuM-L 


57Nle-L 


80Tyr-L(But) 


7AJa-D 


20Asp-D(OBut) 


88ValM-L 


14AiaM-D 


59Nva-D 


TTTrp-D 


8AU-L 


21Asp-L(OBut) 


36GlyM 


l5AJaM-L 


60Nva-L 


78Trp-L 


5Aib-L 












2 


5 


8 


11 


15 


17 


16Arg-D(Mtr) 


18Asn-D 


70PhepNt-L 


48Lys-D(Boc) 


61Gtd-D(BOC) 


73Ser-D(But) 


L7Arg-LCMtr) 


19Asn-L 


120PhcpNt-D 


51Lys-L(Boc) 


620m-L(BOC) 


74Scr-L(But) 


44Leu-D 


34GlyC-L 


37GlyP-L 


52Met-D 


38His-D(Tn) 


75Thr-D(But) 


45Leu-L 


35GlyC-D 


89Hyp-L(Tbu) 


53Met-L 


39His-L(Tn) 


76Thr-L(But) 


9AlaB 












3 


6 


9 


12 


15 


18 


HAcaE 


29Gln-L 


42Ile-L 


66PheM-D 


63Phe-D 


7lProD 


lAba-L 


28Gln-D 


85Val-D 


68PheM-L 


64Phe-L 


72Pro-L 


2Abu-L 


22Avi5 


86Val-L 


43UeM-L 


54MetS-L 


81Tyr2,5lI-L 


3AbuG-L 


65Phe-4CI-L 


33Gly 




55MetSO-L 


82Tyr3,5Br-L 


23Cit-L{Bos) 













An individual bead in the library would display mixture of peptides which, at the second amino acid 
35 position, would feature only amino acids from a single group. Different beads could display amino acids 
from different groups. The same groups were used for synthesis steps 2, 3, 4, 5. 

For step 6 (position 2 from the N-terminal) the amino acids were initially grouped as in Table C-2: 



40 



45 



50 



55 
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T«ble C-2 



5 



JO 



15 



20 



1 


7 


13 


19 


25 


31 


37 


10AUC-D 


31Glu-D(Obut) 


58NleM-L 


46L*uM-D 


56NU-D 


79Tyr-D(But) 


5Aib-L 


12AJ«C-L 


32Glu-L(Obul) 


87VtlM-D 


47UuM-L 


57Nle-L 


80Tyr-L(But) 


9AJ.B 


2 


8 


14 


20 


26 


32 




7Ala-D 


20Asp-D(OBut) 


S8V.IM-L 


14AJ«M-D 


59Nv»-D 


77Trp-D 




SAI«-L 


2lAsp-L(OBut) 


36GlyM 


15AJ.M-L 


60Nvt-L 


78Trp-L 




3 


9 


IS 


21 


27 


33 




16Arf-D(Mtr) 


18Asn-D 


70Ph*pNt-L 


48Lys-D(Boc) 


610m-D{BOQ 


73Ser-D(Bul) 




17Axg-L(Mtr) 


19Asn-L 


120Ph«pNt-D 


51Lys-L(Boc) 


620rn-L(BOC) 


74Ser-L(But) 




4 


10 


16 


22 


28 


34 




44Leu-D 


34GlyC-L 


37GlyP-L 


52M*-D 


3SHii-D<Trt) 


75Thr-D(Bui) 




45Uu-L 


35GlyC-D 


89Hyp-L(Tbu) 


53Met-L 


39Hii-LCTrt> 


76Thr-L(But) 




5 


11 


17 


23 


29 


35 




tlAc*E 


29Gln-L 


42De-L 


66PheM-D 


63Phe-D 


7IPro-D 




lAb*-L 


28Gln-D 


85V.I-D 


68Ph«M-L 


64Phe-L 


72 Pro- L 




6 


12 


18 


24 


30 


36 




2Abu-L 


22Avt5 


86Vil-L 


43IUM-L 


54Me(S-L 


81Tyr2,5II-L 




3 AbuG-L 


65Phe^*Cl-L 


33Gly 


23Cit-L(Bos) 


55M*tSO-L 


82Tyr3,5Br-L 





30 

For reasons of convenience, the initial 37 step 6 groups were merged into 12 larger groups, labeled A-L 
in Table C-3. Thus step 6 groups 1-3 went into tube A, groups 4-6 into tube B, etc. 



Table C-3 



A 


B 


C 


0 


E 


F 


O 


H 


I 


J 


K 


L 


1 


4 


T 


10 


13 


1* 


1* 


33 


11 


a 


JI 


54 


lOAhC- 


*4 L*u- D 


3lGlu- 


»*GtyC- 




TTOtyP-L 


tfLcuM- 


i2Mct-D 


MNb-D 








D 




tXOhm) 


L 


L 




D 






WT*) 






IIAkC- 




J2Gtw- 










33Mei-L 


57NV-L 


39Hu- 


KTTyT- 




L 






D 


0 


UTbu) 


L 






UTnl 







50 



55 
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70 



J5 



20 



Jk 
f\ 


B 

P 


c 


D 


E 


F 


c 


H 




i 


K 


L 


2 


5 


1 


tl 


14 


17 


2* 


U 


2* 


» 






7Ah-D 


1 1 Ae»E 


XAjp- 
D(Otu) 




HVmiM- 

L 


<:n<>-L 


MAhM- 

D 


D 


WNw D 




77TVO 




lAk-L 


1 Aha-L 


L{OfU*) 


2«Cln-D 


■ 


15V»J-D 


ISAkM- 


MPhcM- 
L 


eONn-L 




TTTrp-L 


TZPro-L 


3 


t 


• 


12 


If 


*■ 








3f 






WWtr) 


LAbu-L 


IIAjii-D 




-L 




DtBoc) 


OOeM-L 


6IOn- 
CXBOO 


MMciS- 
L 


tXB-1 


j n-i 


17AJ-I- 
UMlr) 


3AbuC-L 


I9Ajo-L 


(iiPhe- 


Ni-D 


JJGI T 


UBoe) 


UCu- 


620m- 
UBOO 


ISMciSO 

-L 


7*Sc f - 


tTTyr3. 
3Bf-L 
























37 
























JAA-L 
























9AkB 



In Table C-4, the amino acids listed in the column headings in C-4 are those inserted at the second 
25 position from the N-terminal. Either of two different amino acids are found at this position in the peptides on 
a given bead. Since Tube A comprises beads from three different step 6 groups, there are six possibilities 
for this position for the beads of a given tube A-L. 

The contents of tube A were then distributed into 37 micronic tubes to be placed in column "A" of a 
microtiter plate. The contents of each of these 37 tubes was reacted with the corresponding two amino acid 
30 mixture set forth on the left side of Table C-4, thus supplying the amino terminal amino acid. The contents 
of tube B went into the 37 tubes of the next column, which were similarly reacted, and so on for a total of 
12x37 = 444 tubes, as shown in Table C-4. 

The library was screened for binding IL6 that was labeled with fluorescent label Cy3. No staining was 
observed in any of the welts. 
35 The library was stained with TBP1 that was labeled with tetramethyl rhodamine. Staining was observed 
in the wells marked with "1" in Table C-3 below. Positive staining means that the well contained at least 4 
stained beads. 
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Table C-4 



JO 



15 



20 



25 



30 



35 



40 



45 



50 









Possible amino acids al 2nd position from N-ternunaJ 








10AJ*C- 
D 


44Leu- 
D 


31GIu- 
D(Obut) 


34GtyC 
L 


58NkM 
L 


37GlyP- 
L 








12A1»C- 

* 

L 


45Leo- 
t 


32G!u- 


35GlyC- 
D 


87V.IM- 

D 


S9Hyp- 
L(Tbu) 












20 Asp - 
D<OBut) 


29Gln-L 


SSValM- 

L 


42De-L 








HAL* T 


1 Ah*-! 


2) A*p- 

L(OBut) 


28Gln-D 


36GlyM 


S5V.I-D 








16Arg- 
D(Mtr) 


2Abu-L 


ISAin-D 


22Av.5 


70Pt>cpN 

i-L 


S6V«l L 








17Arg 
L(Mlr) 


3AbuG- 
L 


19Am-L 


65Phe- 
4CI-L 


120Phep 
Nt-D 


33Gly 






































No. 


Possible am loo acids at 
N-terminal 


A 


E 


C 


D 


E 


F 


i 


10A1»C-D 


12AUC-L 


0 


0 


0 


0 


0 


0 


2 


7A1» D 


8AJ1-L 


1 


1 


1 


0 


0 


0 


3 


t6Arg- 

D(Mir) 


17Arg- 
L(Mtr) 


I 


1 


1 


1 


I 


0 


A 
*T 


44Lu«-D 


45 Leu- L 


0 


1 


0 


1 


i 


1 


C 
J 


llAc.E 


1 Abi-L 


0 


1 


1 


1 


1 


1 


0 


2Abu-L 


3AbuG-L 


0 


0 


0 


0 


0 


0 


7 


31Glu- 
D(Obut) 


2lGlu- 
UObut) 


0 


0 


1 


0 


0 


0 


8 


2Q\%p- 
D(OBut) 


21A*p- 
UOBut) 


1 


1 


0 


1 


0 


I 


9 


ISAtn D 


L9A*n-L 


1 


0 


0 


0 


0 


0 


10 


34GlyC-L 


35GIyC-D 


0 


0 


1 


1 


0 


0 


11 


29Gln-L 


2SGIn-D 


1 


0 


1 


[ 


0 


0 


12 

1 


22Avt5 


65Phe-4CI- 

L 


0 


0 


0 


0 


D 


0 
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No. 


Possible amino acids at 
N-tcnninaJ 


A 


B 


c 


D 


E 


F 


5 


13 


58N1«M-L 


87VilM-D 


n 


n 

V 


o 


0 


0 


0 




14 


SSVtlM-L 


36GlyM 


ft 
U 


n 


n 

\J 


o 

V 


o 


o 


70 


15 


70PhepNi-L 


120PhepNt- 
D 


O 


0 


ft 

u 


ft 


o 


o 




16 


37GlyP-L 


89 Hyp- 

i_on>u) 


0 


0 


0 


0 


0 


0 




17 


42Ue-L 


85Vi|-D 


0 


0 


1 


0 


0 


1 


75 


18 


86V.1-L 


33Cly 


0 


0 


0 


0 


0 


0 




19 


46L«uM-D 


47LeuM-L 


0 


0 


0 


0 


0 


0 




20 


14AI.M-D 


15AJ*M 1 


ft 

0 


1 


ft 

u 


i 
I 


r 
i 


V 


20 


21 


4&Lys- 
D{Boc) 


5ILyi- 
L(Boc) 


ft 

0 


a 
U 


n 
u 


{\ 

V 


o 


1 

i 




22 


52Mtt-D 


53Met-L 


0 


0 


0 


0 


0 


0 


25 


23 


66Ph*M-D 


68PheM-L 


ft 

0 


0 


o 


u 


ft 

V 


n 




24 


43IleM-L 


23Cit- 
Ufcoi) 


0 


i 


0 


0 


0 


0 




25 


56Nle-D 


57NU-L 


u 


U 






n 

V 


o 

V 


30 


, , 

26 


59Nv*-D 


60Nv«-L 


ft 

u 


U 


A 
V 


n 


n 

V 


fl 




21 


6lOm- 
D(BOC) 


620m- 
LfBOC) 


0 


0 


0 


0 


0 


0 


35 


2S 


2SHii-D(Trt) 


39Hii- 




t 
J 


u 


o 

V 


o 


o 




■>Q 

Zy 


63Phe-D 


64PhcL 


ft 
U 


U 


1 


V 


o 


o 


40 


30 






0 


0 


1 


0 


0 


0 


31 


79Tyr-D(Bul) 


SOTyr- 
L(but) 


0 


0 


0 


0 


0 


0 




32 


77Trp-D 


78Trp-L 


0 


0 


0 


0 


0 


0 


45 


33 


73S«r-D(But) 


74Ser- 
L(But) 


0 


0 


0 


0 

I 1 


1 


0 




34 


75Thr-D(But) 


76Hir- 


0 


0 


0 


0 


0 


0 


50 






UBut) 










' — - — 





35 


71Pro-D 


72 Pro L 


0 


0 


0 


0 


0 


0 

1 
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5 



No. 


Possible amino acids at 
N- terminal 


A 


B 


c 


D 


E 


F 


36 


8lTyr2.5U-L 


82Tyr3,5Br- 
L 


0 


0 


0 


0 


0 


0 


— 

37 


5Aib-L 


9AJ«B 


0 


0 


0 


0 


0 


0 
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15 



20 



25 



30 



35 



40 



45 



50 
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TO 



15 



20 



25 



30 



35 



40 



45 



50 









Possible Amino acids al 2nd position From N-terminal (Coat'd) 








46UuM 

D 


52Met- 
D 


56Nle-D 


3SHis- 
DfTrt) 


79Tyr- 
D(Bo() 


75Thr- 
DfBut) 








47UuM- 
L 


S3Met- 
L 


57NI^L 


39Hu- 
L(T") 


8QTyr- 
LfBut) 


76Thr- 
L(ButJ 








I4A1*M- 
D 


66PheM 
■D 


59Nv»-D 


63Phe-D 


77Trp-D 


7]Pro-D 








iSAltM- 
L 


6SPheM 
L 


60Nva-L 


64Phe-L 


78Trp-L 


72Pro-L 








48Ly» 

D(Boc) 


4311eM- 
L 


6!On>- 
D(BOQ 


54M«lS- 
L 


735er- 
D(Bm) 


O 1 ~~T~ ^ 

SlTyri, 
5 D-L 








51Ly»- 
L(Boc) 


23Cu 


62)On»- 
LfBOQ 


55MeiSO 

-L 


74Ser- 
UBut) 


S2Tyr3, 
5 Br L 


















5Ajb-L 


















9AJ.B 


No. 


Possible amino acids al 
N- terminal 


G 


H 


I 


J 


K 


L 


1 


10AUC-D 


12AUC-L 


1 


0 


0 


0 


0 


0 


2 


7AU-D 


8AJi-L 


1 


0 


1 


1 


0 


0 


3 


l6Arj- 
D(Mtr) 


I7Arg 
L(Mtr) 


0 


0 


0 


0 


0 


0 


4 


44Lue-D 


45Leu-L 


0 


1 


1 


I 


1 


0 


5 


MAciE 


lAbi-L 


0 


0 


1 


0 


1 


1 


6 


2Abu-L 


3AbuG-L 


0 


1 


0 


0 


0 


1 


7 


3IG!u- 
D{Obut) 


:iGlu- 
L(Obot) 


0 


0 


0 


0 


0 


1 


8 


20A»p- 
D(OBut) 


21Aip- 
L(OBui) 


0 


0 


0 


0 


0 


1 


9 


18A*n-D 


19Asn-L 


1 


1 


0 


0 


0 


0 


10 


34GlyC-L 


35GlyC-D 


0 


! 


0 


0 


0 


0 


11 


29Gln L 


28Gln D 


0 




0 


0 


0 


0 


0 


12 


22Avt5 


65Phe-4Cl- 
L 


0 


0 


0 


0 


0 


0 
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No. 


Passible am loo acids at 
N-terminal 


G 


H 


I 


J 


K 


L 


5 


13 


58NkM-L 


S7V.IM-D 


0 


0 


0 


0 


0 


0 




14 


S8WLM-L 


36GlyM 


0 


0 


0 


0 


0 


0 




15 


70Phe P Ni-L 


120Ph*pNt- 


0 


0 


0 


0 


0 


0 


JO 






D 
















16 


37GlyP-L 


89Hyp- 
L(Tbu> 


0 


0 


0 


0 


0 


0 


75 


17 


42He-L 


85V«I-D 


0 


0 


0 


0 


0 


0 




18 


S6V.I-L 


33Gly 


0 


0 


0 


0 


0 


0 




19 




46LeuM-D 


47UuM-L 


0 


0 


0 


0 


0 


0 


20 


20 


14AUM-D 


15AUM-I 


0 


0 


□ 


t 


0 


0 




21 


4SLys- 
D(Boc) 


51Lys- 
L(Boc) 


0 


0 


0 


0 


0 


0 


25 


22 


52Met-D 


53Met-L 


0 


0 


0 


0 


0 


1 




23 


66PheM-D 


6SPh*M-L 


0 


0 


0 


0 


0 


0 




24 


43UeM-L 


23Cit- 


0 


0 


1 


0 


0 


0 


30 






L(Boi) 














25 


56Nle-D 


57Nle-L 


0 


0 


0 


0 


0 


0 




26 


59Nvi-D 


60Nv«-L 


0 


0 


0 


0 


0 


0 


35 


77 


6!Om- 


620rn- 


0 


0 


0 


0 


0 


0 




DfBOQ 


L(BOO 
















tt 

L B 


28His-D(Trt) 


39Hii- 
LfTrt) 


0 


0 


0 


0 


0 


0 


40 


29 


63Phe-D 


64Phe-L 


0 


0 


1 


0 


1 


0 




30 


54M«tS-L 


55M«SO-L 


V 


n 


n 

V 


V 


ft 

u 


0 


45 


31 


79Tyr-D(But) 


SOTyr- 
L(bui) 


0 


1 


0 


0 


0 


0 




32 


77Trp-D 


78Trp-L 


0 


0 


0 


0 


0 


0 


50 


33 


73Ser-D(But) 


74Ser- 
L(But) 


0 


0 


0 


0 


0 


0 



55 



45 




EP 0 639 584 A1 



5 



I 

No. 


Possible amino acids at 
N-termioaJ 


G 


H 


I 


J 


K 


L 


34 


75Thr-DfBui) 


76(Thr- 
L(But) 


0 


0 


0 


0 


0 


0 


35 


71Pro-D 


72 Pro- L 


0 


0 


1 


0 


0 


0 


36 


81Tyr2,5D-L 


S2Tyr3.5Br- 
L 


0 


0 


0 


0 


0 


0 


37 


5Aib-L 


AJiB 


0 


0 


0 


0 


0 


0 



Staining procedure 

20 The beads are adsorbed in 6 well plates. To the wells is added PBS containing 0.1% Tween 20 and 
0.1% sodium azide {wash buffer). The wash buffer was removed by suction. To the wells was added 200ul 
of blocking buffer {PBS + 1mg/ml of l-block + Ca ++ 0.9 mM + Mg ++ 0.5 mM) containing 5 ug/ml of TBP1 
(over 99% pure) labeled with tetramethyl rhodamine. The solution was incubated in the well for 30 minutes 
with gentle agitation. The wells were filled up (about 5 ml) with wash buffer. The wash buffer was removed 

25 by suction. 

Analysis procedure 

The wells were observed by fluorescence inverted microscope under 100x magnification. Some 
so fluorescent debris were observed. Thus, suspected positive beads were verified by observation at 400x 
magnification. 

Reference Example: Peptide and Peptoid Synthesis 
35 Supports: 

The most common support material for solid phase peptide synthesis consists of beads made of 
polystyrene cross linked with 1-4% of divinylbenzene {DVB). Other supports which have been used include, 
among other, modified paper (cellulose) either as sheet or as Perioza beaded cellulose, polyamide based 
40 resins some of which are based on polyacrylamide, grafted polyethylene; Polyethylene glycol modified 
polystyrene/1% DVB which is commercially available as Tentagel from several companies; Modified glass 
either as sheets (as used by Affimax), or as beads. Virtually, any material stable to the solvents and 
chemicals used in peptides synthesis might be used as a support. The preferred support is Tentagel. 
Some reference for novel supports are cited below: 
45 1. Mendre, C, Sarrade, V. and Calas, B. Continuous flow synthesis of peptides using a polyacrylamide 
gel resin (Expansin™ ). International Journal of Peptide and Protein Research 39:278-284, 1992. 
2. Kanda, P., Kennedy, R.C. and Sparrow, J.T. Synthesis of polyamide supports for use in peptide 
synthesis and as peptide-resin conjugates for antibody production. Int. J. Pept. Protein Res 38:385-391, 
1991. 

50 3. Kiederowski, G. Light-directed parallel synthesis of up to 250,000 different oligopeptides and 
oligonucleotides. Angew. Chem. Int. Ed. Engl. 30:822, 1991. Note: Synthesis on modified glass This is 
the technology used by Affimax. 

4. Valerio, R. M., Benstead, M., Bray, A. M., Campbell. R. A. and Maeji, N.J. Synthesis of peptide 
analogues using the multipin synthesis method. Anal.Biochem 197:168-177, 1991. Note: Synthesis on 

55 grafted polyethylene rods. 

5. Calas, B., Mery, J.. Parello, J. and Cave, A. Solid-phase synthesis using a new polyacrylic resin. 
Tetrahedron 41:5331-5339, 1985. 
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6. Englebresten, D.R. and Harding, D.R.K. Solid phase peptide synthesis on hydrophilic supports. Part 
II-- Studies using Perloza beaded cellulose. Int. J. Pept. Protein Res. 40:487-496, 1992. 

Protecting groups: 



The two most common protecting groups for the a-amino group are tert-butyloxycarbonyl (Boc) or 9- 
Ftourenylmethoxycarbonyl (Fmoc). The Boc protecting group is removed by acid conditions, e.g. by 100% 
TFA. The Fmoc protecting groups are removed under basic conditions e.g. by 20% piperidine in DMF, We 
prefer to use the Fmoc strategy. 
io Side chains of some of the amino acids also need to be protected to avoid damage during synthesis or 
to avoid formation of branched peptides by incorporation of amino acids at free amino group appearing on 
the side chain of a few amino acids (e.g. lysine, ornithine). The common protecting groups are as follows: 
For free amino groups: Boc; Fmoc; Benzyloxycarbonyl (CBZ); For free carboxy: tert. butyl ester; Benzyl 
ester; Cyclohexyl ester; 

T5 For Arginine: Nirto (N0 2 ); Mesitylenesulfonic acid; Tosyl; Aspargine: Xanthyl; Methoxy-2,3,6-trimethylsulfone 
(Mtr); For Cystein: Acetamidomethyl; Benzyl thiether; tert. butyl thiether; 

Methylbenzyl; Methobenzyl; 3-Nitro-2-pyridinesulfonyl (Npys); For Glutamine: Trityl; Xanthyl (Xan); 
Hisidine: Tosyl; Trityl 

For free hydroxyls: Benzyl ether; tert butyl ether; Tryptophan: N-Formyl; N-Bromobenzoxycarbonyl: 
20 Nitrophenylsulfonyl (Nps); 

Tyrosine: O-Benzyl; 0-2,6-dichtorobenzyt; 
We prefer the following protecting groups: 

For free amino groups: BOC 

For free carboxy I groups: tert. butyl ester. 
25 For Cystein: Note used, 

For Glutamine and Aspargine: None. 

For Histidine: Trityl. 

For free hydroxyl groups: tert. butyl ether. 
For Tryptophan: None 
30 For Arginine: Methoxy-2,3,6-trimethylsulfone (Mtr) 

Coupling reagents: 

The coupling of amino acids in solid phase peptide synthesis has been under study since the 
35 introduction of the technique by Merrifield in 1967. Today, the synthesis of short peptides composed of the 
20 common L-amino acids can be performed by any of a large selection of methods. Recent reports in the 
field describe novel methods intended to: 
1 , Shorten synthesis time. 
2 Reduce racemization during synthesis. 
40 3 Enable synthesis of long peptides. 

4. Enable synthesis of "difficult" sequences. 

5 Enable incorporation of unnatural amino acids such as N-substituted amino acids, or a-carbon di- 
substituted amino acids where the free H is replaced with another group. 

6 Improve the automation of peptide synthesis. 
45 7 Improve large scale peptide synthesis. 

8 Produce peptides with different types of pseudopeptide bonds such as: 
Carba *(CH2); 
Depsi +(CO-0); 

Hydroxyethylene *(CHOH-CH2); 
so Ketomethylene ♦(CO-CH2); 
Methylene-oxy CH2-0-; 
Reduced CH2-NH; Retro inverso NH CO; 
Thiomethylene CH 2 -S-; 
Thiopeptide CS-NH. 

55 9. Improve production of peptides having constrained conformations such as cyclic peptides or multiple 
antigen peptides (MAPs). 
Several recent examples for studies of peptide synthesis are cited below: 
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1. Schnolzer, M., Alewoocl, P., Jones, A., Alewood, D. and Kent, S.B.H. In situ neutralization in Boo 
chemistry solid phase peptide synthesis. Rapid, high yield assembly of difficult sequences. Int. J. Pept. 
Protein Res. 40:180-193, 1992. 

2. Chen, S. and Xu, J. A new coupling reagent for peptide synthesis. Benzotria2olyloxy-bis (pyrrolidino)- 
carbonium hexafluorophosphate (BBC). Tetrahedron Letters 33:647-650, 1992. 

3. Spencer, J.R., Antonenki, V.V., Delaet, N.G.J, and Goodman, M. Comparative study of methods to 
couple hindered peptides. Int. J. Pept. Protein Res. 40:282-293, 1992 

4. 1. Kiso, Y., Fujiwara, Y., Kimura, T., Nishitani, A. and Akaji, K. Efficient solid phase peptide synthesis. 
Use of methanesulfonic acid a-amino deprotecting procedure and new coupling reagent, 2-(benzotriazol- 
1-ul)oxy-1,3-dimethylimidazolidinium hexafluorophosphate (BOI). Int. J. Pept. Protein Res. 40:308-314, 
1992. 



Solvent: We prefer to use dimethylformamide (DMF) throughout the synthesis. We sometimes add 
some dichloromethane {DCM) in order to enable easier collection of the beads: The density of the DCM is 
very high and beads float in mixtures of DMF and DCM. 

It should be understood that the present invention is not limited to the use of a particular support, 
protective group, or solvent. 



Preferred Synthetic Cycle 

One cycle of the synthesis consists of the following operations: 



30 



35 



40 



45 



50 



Materials: 
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Table 100: Peptide Synthesis Cycle 



5 






£vw Ci w Ilk— 


Volume/ml 
total resin 
(/il/well) 


Time 
(min . ) 


Notes 




1 


Fmoc 

deprotec- 
tion 


20% 

» 1 4 1 

piperidine 
in DMF 


300 


10 


in 

wel 1 s 


10 


2 


sup . 
removal 










15 


3 


Fmoc 

deprotec- 
t ion 


20% 

piperidine 
in DMF 


300 


5 




4 


sup . 
removal 












5 


wash 


DMF 


300 


5 




20 


6 


sup . 
removal 












7 


wash 


DMF 


300 


5 




25 


8 


sup . 
removal 












9 


wash 


DMF 


300 


5 






10 


sup . 
removal 












11 


resin 
collec - 
tion 


200 jil/well 
X3 times 


mixing 
thoroughly 




• 

in 

bulk 






12 


dividing 
resin 


equal 

volume/well 


with mixing 




into 
wells 


35 


J. J 


removal 












i a. 


W U O 1 1 


DMF 


300 


1 min . 




40 


15 


sup . 
removal 










45 


16 


coupl ing 


3 equiv. 

Fmoc AA 

3 equiv. BOP 

5 equiv. NMM 

3 equiv. 

Hobt 




30 

min . 


vor- 
tex 




17 


sup . 
removal 










50 


18 


wash 


DMF 


300 


5 
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19 


sup . 
removal 








" — ■- 


20 


wash 


DMF 


300 


5 




21 


sup . 
removal 










22 


wash 


DMF 


300 


5 




23 


sup . 
removal 










24 


re - 

coupling 


3 equiv. FMOC 
AA 

3 equiv. BOP 
5 eouiv. NMM 
3 equiv. 
Hobt 




30 


vor- 
tex 


25 


sup . 
removal 








■ 


2 6 


wash 


DMF 


3 00 


5 




27 


sup . 

X. CHIU VCti. 










28 


wash 


DMF 


300 


5 




29 


sup . 
removal 










30 


wash 


DMF 


300 


5 min. 




31 


sup . 
removal 











35 

Abbrevia ti ons .- 

DMF - N,N Dimethyl formamide 
Hobt - Hydroxybenzotriazole - 1 Hydrate 
40 NMM - 4 Methyl morpholine 

BOP - Benzotriazol-l-YL-OXY-TRIS- (Dimethylamino) Phosphonium 

Hexafl uorophosphate 

Fmoc - Fluor enylmethyoxycarbonyl 

45 

Global deprotection: of side chain protecting groups is performed at the end of the synthesis by twice 40 
minutes incubations in: 25% TFA in DCM + 5% anisol and 5% thioanisol. 

so Peptides including Unusual Amino Acids 

The current trend in peptide synthesis, especially in the approach of irrational drug design calls for the 
incorporation of many different types of building blocks. Many of the new building blocks used are amino 
acids which are not genetically encoded, e.g., glycosylated amino acids, or various unnatural amino acids. 
55 The references cited below indicate some of these recent efforts: 

1. Bielfeldt, T„ Peters, S., Meldal, M., Bock, K. and Paulsen, N.A. new strategy for solid-phase synthesis 
of O-glycopeptides. Angew. Chem. (Engl) 31 :857-859, 1992. 
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2. Gurjar, M.K. and Saha, U.K. Synthesis of the glycopeptide-O-(3,4-di-O-methyl-2-0-[3,4-di-O-methyl- a - 
L-rhamnopyranosyl]-o-L-rhamnophyranosyi)-L-alanilol: An unusual part structure in the glycopeptidolipid 
of Mycobacterium fortuitum . Tetrahedron 48:4039-4044, 1992. 

3. Kessler, H., Wittmann, V., Kock, M. and Kottenhahn, M. Synthesis of C-glycopeptides via free radical 
5 addition of glycosyl bromides to dehydroalanine derivatives. Angew. Chem. (Engl.) 31:902-904, 1992. 

4. Kraus, J.L. and Attardo, G. Synthesis and biological activities of new /V-formylated methionyl peptides 
containing an a-substituted glycine residue. European Journal of Medicinal Chemistry 27:19-26, 1992. 

5. Mhaskar, S.Y. Synthesis of N-lauroyl dipeptides and correlation of their structure with surfactant and 
antibacterial properties. J. Am. Oil Chem. Soc.69:647-652, 1992. 

to 6. Moree, WJ., Van der Marel, G.A. and Liskamp, R.M.J. Synthesis of peptides containing the 0- 
substituted aminoethane sulfinamide or sulfonamide transition-state isostere derived from amino acids. 
Tetrahedron Lett. 33:69-6392. 1992. 

7. Paquet, A Further studies on the use of 2,2,2-trichloroethyl groups for phosphate protection in 
phosphoserine peptide synthesis. International Journal of Peptide and Protein Research 39:82-86, 

is 1992 

8. Sewald, N , Riede, J., Bissinger, P. and Burger, K. A new convenient synthesis of 2-trifluoromethyl 
substituted aspartic acid and its isopeptides. Part 11. Journal of the Chemical Society. Perkin 
Transactions 1 1992:267-274, 1992. 

9. Simon, R.J., Kania, R.S., Zuckermann, R.N., Huebner, V.D., Jewell, D.A., Banville. S.. Ng, S , Wang, L, 
20 Rosenberg, S., Marlowe, C K, Spellmeyer, D.C., Tan. R.. Frankel, A.D., Santi, D.V., Cohen, F.E. and 

Bartlett. PA Peptoids: A modular approach to drug discovery. Proc. Natl, Acad. Sci. USA 89:9367-9371, 
1992 

10. Tung, C.-H., Zhu, T., Lackland, H. and Stein, S. An acrtdine amino acid derivative for use in Fmoc 
peptide synthesis. Peptide Research 5:115-118, 1992. 

25 11. Elofsson. M. Building blocks for glycopeptide synthesis: Glycosylation of 3-mercaptopropionic acid 
and Fmoc amino acids with unprotected carboxyl groups. Tetrahedron Lett. 32:7613-7616, 1991. 

12. McMurray, J.S. Solid phase synthesis of a cyclic peptide using Fmoc chemistry. Tetrahedron 
Letters 32:7679-7682, 1991 

13. Nunami, K.-l., Yamazaki, T. and Goodman, M. Cyclic retro-tnverso dipeptides with two aromatic side 
so chains. I. Synthesis. Biopolymers 31:1503-1512, 1991. 

14. Rovero, P. Synthesis of cyclic peptides on solid support. Tetrahedron Letters 32:2639-2642, 1991. 
15 Elofsson, M., Walse, B. and Kihlberg, J. Building blocks for glycopeptide synthesis: Glycosylation of 
3-mercaptopropionic acid and Fmoc amino acids with unprotected carboxyl groups. Tetrahedron Letter, 
32:7613-7616. 1991. 

35 16 Bielfeldt, T., Peter, S„ Meldal, M., Bock, K. and Paulsen, H. A new strategy for solid-phase synthesis 
of O-glycopeptides. Agnew. Chem (Engl) 31:857-859, 1992. 

17 Luning, B., Norberg, T. and Tejbrant, J. Synthesis of glycosylated amino acids for use in solid phase 
glycopeptide synthesis, par 2:N-(9-fluorenylmethyloxycarbonyl)-3-0-[2,4,6-tri-0-acetyl-ar-D- 
sytopyranosyi)-0-D-glucopyranosy!]-L-serine. J. Carbohydr. Chem. 11:933-943, 1992. 
40 18. Peters. S , Bielfeldt, T„ Meldal, M., Bock, K. and Paulsen, H. Solid phase peptide synthesis of mucin 
glycopeptides. Te trahedron Lett. 33:6445-6448. 1992. 

19 Urge, L., Otvos, L, Jr., Lang, E., Wroblewski, K., Laczko.l. and Hollosi, M. Fmoc-protected, 
glycosylated asparagines potentially useful as reagents in the solid-phase synthesis of N-glycopeptides, 
Carbohydr. Res. 235:83-93, 1992. 
45 20. Gerz, M., Matter, H. and Kessler, H., S-glycosyiated cyclic peptides, Angew. Chem. (Engl.) 32:269- 
271, 1993. 

Branched Peptides 

so One of the advantages of the chemical approach to peptide libraries (as opposed to libraries expressed by 
biological means, e.g. filamentous phages) is the ability to produce and test branching peptides. Early 
examples in the literature for such structures are found in the use of multiple antigenic peptides (MAP) as 
immunogens, in MAPs, peptide haptens are attached to a branching "tree" of lysines. 

1. Baleux, F, and Dubois, P. Novel version of Multiple Antigenic Peptide allowing incorporation on a 
55 Cysteine functionalized lysine tree. Int. J. Pept. Protein Res. 40:7-12, 1992. 

2. Munesinghe, D.Y., Clavijo, P., Calle. M.C., Nussenzweig, R.S. and Nardin, E. Immunogenicity of 
multiple antigen peptides (MAP) containing T and B cell epitopes of the repeat region of the P. 
falciparum circumsporozoite protein. Eur. J. Immunol. 21:3015-3020, 1991. 
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In the current MAP system identical peptide sequences are repeated several time in the MAP structure. 
Such an arrangement apparently stabilizes the peptide conformation, allowing for better presentation of the 
antigenic structure and hence better immunogenicity. The use of approaches similar to the MAP could 
enable better biological activity of peptides due to stabilization of conformation. 

5 The interaction of short linear peptides with their targets occurs along the peptide length. Formation of 

branching peptides may enable interaction of the peptide with the target throughout a surface and thus 
mimic the type of interaction of some antibodies with their target antigens (as observed by X-ray 
crystalography and analysis of antibody-antigen complexes). This type of interaction opens up new 
possibilities for small peptide-ligand interactions which are non-existent for linear peptides but existent for 

;o protein-ligand interactions. 

Cyclic Peptides 

Many naturally occurring peptide are cyclic. Cyclization is a common mechanism for stabilization of 
rs peptide conformation thereby achieving improved association of the peptide with its ligand and hence 
improved biological activity. Cyclization is usually achieved by intra-chain Cystine formation, by formation of 
peptide bond between side chains or between N- and C-terminals. Cyclization was usually achieved by 
peptides in solution, but several publications have appeared recently that describe cyclization of peptides 
on beads (see references below). These published techniques may be directly applicable to our library 
20 approach. 

1. Spatola, A.F., Anwer, M.K. and Rao, M.N. Phase transfer catalysis in solid phase peptide synthesis. 
Preparation of cycle [Xxx-Pro-Gly-Yyy-Pro-Gly] model peptides and their conformational analysis. Int. J. 
Pept Protein Res. 40:322-332, 1992 

2. Tromelin, A., Fulachier, M.-H., Mourier, G. and Menez, A. Solid phase synthesis of a cyclic peptide 
25 derived from a curaremimetic toxin. Tetrahedron Lett. 33:5197-5200, 1992. 

3. Trzeciak, A. Synthesis of 'head-to-tail' cyclized peptides on solid supports by Fmoc chemistry. 
Tetrahedron Lett. 33:4557-45560, 1992. 

4. Wood, S. J. and Wetzel, R. Novel cyclization chemistry especially suited for biologically derived, 
unprotected peptides, int. J. Pept Protein Res. 39:533-539, 1992. 

30 5. Gilon. C„ Halle, D., Chorev, M., Selinger, Z. and Byk, G. Backbone cyclization: A new method for 
conferring conformational constraint on peptides. Biopolymers 31:745-750, 1991. 

6. McMurray, J. S. Solid phase synthesis of a cyclic peptide using Fmoc chemistry. Tetrahedron 
Letters 32:7679-7682, 1991. 

7. Rovero, P. Synthesis of cyclic peptides on solid support. Tetrahedron Letters 32:2639-2642, 1991. 
35 8. Yajima, X. Cyclization on the bead via following Cys Acm deprotection. Tetrahedron 44:805, 1988. 

Pep toid Syn thesis 

Most, if not all of the materials containing at least 1 free amino and 1 free carboxyl group might be 
40 used for synthesis of polypeptide polymers. Most if not all the materials having only a single type of groups 
(i.e. f ree amino or free carboxyl), can be incorporated at the C-or N- terminals respectively, or at 
appropriate side chains. Most if not all materials having a groups reactive with free carboxyl or free amino, 
free bydroxyl or free thio groups could be used for modification of terminal or the side chains of appropriate 
amino acids. So far only a small variety of such compounds have actually been used for synthesis. Some of 
45 the reported structures are described below: 

1. Modification of the R group in single Ca substituted amino acids. Modified R groups that have been 
reported are: Glycosylated, phosphorylated, sulfated, metal chelators, nucleotide residues, and many 
others. 

2. Modification of the peptide bonds into pseudopeptide bonds: The pseudopeptide bonds are usually 
50 incorporated into di-pseudopeptides which are then incorporated into peptides. It is not possible to 

sequence such pseudopeptides and thus sequence determination would have to rely on encoding. 
Following is a list of most of the pseudopeptide bonds which were described in the literature. 
Carba *(CH 2 -CH 2 ) 
Depsi *(CO-0) 
55 Hydroxyethylene *(CHOH-CH 2 ) 
Ketomethylene *(CO-CH 2 ) 
Methylene-ocy CH 2 -0- 
Reduced CH 2 -NH 
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Retro inverso NH CO 
Thiomethylene CH 2 -S- 
Thiopeptide CS-NH 

3. Backbone modifications: Use of non-or amino acids e.g., ^-Alanine. 

4. a-amino acids with 2 R groups of the Ca. The R groups may be similar or different. 

5. Dehydroamino acids (see below). 

COOH 

\ 

C-NHj 
II 

CH 



R 

6. N-modified amino acid of the general structure. 

COOH 



H — C Rj 



H N R 2 
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Table 101 



ID 


Short 


Name 1 


D/L 


Pro- 


Pro- 




name 






tec - 


tec- 










ting 


ting 










Groups 


Groups 










o; 


side 










amine 


chain 


1 


Aba2-L 


Anthranilic a. 




Fmoc 




2 


Abu- L 


alpha - L- Aminobutyric 


L 


Fmoc 
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3 


AbuG-L 


gamma - L - Aminobutyric 
a . 


L 


Fmoc 




5 


4 


Asp-L 


L-Aspartic (ODmb) 


L 


Fmoc 


ODmb 




5 


Aib-L 


alpha - Methyl -L- Ala 




Fmoc 






6 


Lys- L 


L- Lysine (Dde) 


L 


Fmoc 


Dde 


10 


7 


Ala-D 


D- Alanine 


D 


Fmoc 






8 


Ala-L 


L- Alanine 


L 


Fmoc 




15 


9 


AlaB 


Beta -Alanine 




Fmoc 






10 


AlaC-D 


Cyclohexyl -D-Alanine 


D 


Fmoc 






11 


AcaE 


Epsyl on -amino- Caproic 




Fmoc 




20 






acid 










12 


AlaC-L 


Cyclohexyl - L- Alanine 


L 


Fmoc 






13 


Asp-L 


L-Aspartic (0-2 -Ada) 


L 


Fmoc 


0 - 2 - Ada 


25 


14 


AlaM-D 


N-Methyl -D-Ala 


D 


Fmoc 






15 


AlaM-L 


N-Methyl-L-Ala 


L 


Fmoc 






16 


Arg-D 


D-Arginine (Mtr) 


D 


Fmoc 


Mtr 


30 


17 


Arg-L 


L-Arginine (Mtr) 


L 


Fmoc 


Mtr 




18 


Asn-D 


D- Aspargine 


D 


Fmoc 




35 


19 


Asn- L 


L- Aspargine 


L 


Fmoc 






20 


Asp-D 


D- Aspartic (OBut ) 


D 


Fmoc 


OBut 




21 


Asp-L 


L-Aspartic (OBut) 


L 


Fmoc 


OBut 


40 


22 


Ava5 


5 - Aminovaleric a. 




Fmoc 






23 


Cit-L 


L-Citmlline (Boc) 


L 


Fmoc 


S-Bzl 


45 


24 


Cys-L 


L-Cys (S-Bzl ) 


L 


Fmoc 


S-Bzl 


25 


Cys-L 


L-Cys (Acm) 


L 


Fmoc 


Acm 




26 


Cys-L 


L-Cys (But) 


L 


Fmoc 


But 


50 


27 


Cys - L 


L-Cys (4 -Me-Bzl) 


L 


Fmoc 


4 -Me - Bzl 



55 
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28 


Gin - D 


D-Glutamine 


D 


Fmoc 




5 


29 


Gin- L 


L-Glutamine 


L 


Fmoc 






, , — 

30 


Gin- L 


L-Glutamine (Dod) 


L 


Fmoc 


Dod 






31 


Glu-D 


D-Glutamic (Obut) 


D 


Fmoc 


Obut 


70 


32 


Glu-L 


L-Glutamic (Obut) 


L 


Fmoc 


Obut 




33 


Gly 


Glycine 




Fmoc 




75 


34 


GlyC-L 


L- Cyclohexyl glycine 


L 


Fmoc 




35 


GlyC-D 


D- Cyclohexy 1 gly cine 


D 


Fmoc 






36 


GlyM 


N-Methylglycine 




Fmoc 




20 


37 


GlyP-L 


L-Phenylglycine 


L 


Fmoc 






38 


His-D 


D-Histidine (Trt ) 


D 


Fmoc 


Trt 




39 


His - L 


L-Histidine (Trt) 


L 


Fmoc 


Trt 


25 


40 


Asp - L 


L-Aspartic (0-2- Ada) 


L 


Boc ! ! 


0-2 -Ada 




41 


Ile-D 


D- Isoleucine 


D 


Fmoc 




30 






ij isoleucine 


T , 








43 


Ile-M 


N- Methyl - L- Isoleucine 


L 


Fmoc 






44 


Leu-D 


D-Leucine 


D 


Fmoc 




35 


45 


Leu- L 


L- Leucine 


L 


Fmoc 






46 


LeuM-D 


N- methyl - L- Leucine 


D 


Fmoc 






47 




LeuM- L 


N- methyl - L- Leucine 


L 


Fmoc 




40 




























48 


Lys-D 


D-Lysine (Boc) 


D 


Fmoc 


Boc 




49 


Lys - L 


L- Lysine ( - ) 


L 


Fmoc 


None 


45 


50 


Lys - L 


L- Lysine (Fmoc) 


L 


Fmoc 


Fmoc 




51 


Lys * L 


L-Lysine (Boc) 


L 


Fmoc 


Boc 




52 
■ - 


Met - D 


D -Methionine 


D 


Fmoc 




50 


53 


Met-L 


L-Methionine 


L 


Fmoc 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



5 4 


Me tS - L 


L- Methionine sulfone 


L 


Fmoc 




■ 

55 


MetSO- 
L 


L-Methionme sulfoxide 


L 


Fmoc 




56 


Nle-D 


D-Norleucine 


D 


Fmoc 




57 


Nle-L 


L-Norleucine 


L 


Fmoc 




58 


NleM-L 


N- Methyl -L- Nor leucine 


L 


Fmoc 




59 


Nva-D 


D-Norval ine 


D 


Fmoc 




60 


Nva-L 


L-Norvaline 


L 


Fmoc 




61 


Orn-D 


D-Ornithine (Boc) 


D 


Fmoc 


Boc 


62 


Orn-L 


L-Ornithine (Boc) 


L 


Fmoc 


Boc 


63 


Phe-D 


D - Phenylalanine 


D 


Fmoc 




64 


Phe-L 


L - Phenylalanine 


L 


Fmoc 




65 


Phe- 
4C1-L 


4 -Chloro-L- 
phenyl alanine 


L 


Fmoc 




66 


PheM-D 


N-Methyl-D- 
phenyl alanine 


D 


Fmoc 




67 


Trp-L 


L-Tryptophan (Boc) 


L 


Fmoc 


Boc 


68 


PheM-L 


N-methyl - L- 
Phenylalanine 


L 


Fmoc 




69 


PhepF- 
DL 


p-Fluoro-DL- 
phenylalanine 


DL 


Fmoc 




70 


PhepNt 

-L 


p-Nitro- L- 
Phenyl alanine 


L 


Fmoc 




71 


Pro-D 


D- Proline 


D 


Fmoc 




72 


Pro-L 


L- Proline 


L 


Fmoc 




73 


Ser-D 


D-Serine (But) 


D 


Fmoc 




But 


74 


Ser-L 


L-Serine (But) 


L 


Fmoc 


But 


75 


Thr-D 


D-Threonine (But) 


D 


Fmoc 


But 



55 



57 
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76 


Thr-L 


L-Threonine (But) 


L 


Fmoc 


But 


5 


77 


Trp- D 


D - Tryptophan 


D 


Fmoc 






78 


Trp-L 


L- Tryptophan 


L 


Fmoc 






79 


Tyr-D 


D-Tyrosine (But ) 


D 


Fmoc 


But 


10 


80 


Tyr-L 


L-Tyrosine (But ) 


L 


Fmoc 


But I 




81 


Tyr3,5 
11 -L 


3 , 5 -Diiodo- L-Tyrosine 


L 


Fmoc 




15 


82 


Tyr3 , 5 
Br- L 


3 , 5 -Dibromo- L-Tyrosine 


L 


Fmoc 






83 


Tyrl - L 


L-Tyrosine (2,6- 


L 


Fmoc 




20 






dichloro- Bzl ) 










84 


TyrM-L 


Methyl - L-Tyrosine (Me) 


L 


Boc ! ! 






85 


Val -D 


D- Valine 


D 


Fmoc 




25 


86 


Val - L 


L- Valine 


L 


Fmoc 






a 7 


ValM-D 


N- Methyl -D- Val ine 


D 


Fmoc 






88 


ValM- L 


N- Methyl -L- Valine 


L 


Fmoc 





30 


89 


Hyp - L 


L-Hydroxyproline - (t- 
Butyl) 


L 


Fmoc 






90 


Asp-L 


L- Aspartic (0- 1 -Ada ) 


L 


Fmoc 


0-1 -Ada 


35 


91 


His-L 


L - His ti dine (Boc) 


L 


Fmoc 


Boc 




92 


His-L 


L-Histidine (Bum) 


L 


Fmoc 


Bum 


40 


93 


His-L 


L-Histidine (Tos) 


L 


Fmoc 


Tos 




94 


Ser-L 


L-Serine (Trt) 


L 


Fmoc 


Trt 




95 


Arg- L 


L-Arginine (Tos ) 


L 


Fmoc 


Tos 


45 


96 


Asn- L 


L-Aspargine (Trt) 


L 


Fmoc 


Trt 




97 


Asp-L 


L-Aspartic (OBzl ) 


L 


Fmoc 


OBzl 




98 


Glu-L 


L-Glutamic (OBzl ) 


L 


Fmoc 


OBzl 


50 


99 


Gin- L 

_ — „. — ,.. ,,,, , 


L-Glutamine (Trt ) J 


L 


Fmoc 


Trt 



55 
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— — — — 

100 


Hyp-L 


L - Hydroxyprol me 


L 


Fmoc 




5 


101 


Ser-L 


L- Serine (Bzl) 


L 


Fmoc 


Bzl 




102 


Thr-L 


L-Threonine (Bzl ) 


L 


Fmoc 


Bzl 


10 


103 


Tyr- L 


L-Tyrosine {Bzl ) 


L 


Fmoc 


Bzl 




104 


Arg-L 


L-Arginine (Pmc) 


L 


Fmoc 


Pmc 




105 


Arg-D 


D-Arginine (Pmc) 


D 


Fmoc 


Pmc 


75 


106 


Asn - L 


L-Aspargine (Dod) 


L 


Fmoc 


Dod 




107 


Asn- L 


L-Aspargine (Mtt) 


L 


Fmoc 


Mtt 


20 


108 


Gin- L 


L-Glutamine (Mtt) 


L 


Fmoc 


Mtt 




109 


Lys - L 


L-Lysine (Z) 


L 


Fmoc 


f-m 

z 


25 


110 


Asn - L 


L-Aspargine (Tmob) 


L 


Fmoc 


Tmob 




111 


Gin- L 


L-Glutamine (Tmob) 


L 


Fmoc 


Tmob 


30 


■ — 
112 




Ser-L 


L-Serine (Ac) 


L 


Fmoc 


Ac 


113 


Ser-D 


D-Serine (Bzl) 


D 


Fmoc 


Bzl 




114 


Thr-D 


D-Threonine (Bzl ) 


D 


Fmoc 


Bzl 


35 


115 


Trp-D 
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Claims 



1. A library of polymeric molecules, each consisting essentially of a plurality of monomeric units, said 
25 library comprising a plurality of different sequences of said monomeric units, said molecules being 

immobilized upon beads, each bead means carrying a plurality of different sequences, the expected 
amount of each such sequence on each beads being sufficient for detection of whether a molecule 
having that sequence will bind to a target of interest, each sequence comprising a familial portion and 
an individual portion, the familial portion having a substantially lesser degree of diversity among the 
30 molecules carried by a single bead than among the molecules of the library as a whole, such familial 
portion thereby being sequenceable upon retrieval of the molecules carried by a single bead. 

2. A method of constructing a library of polymeric molecules which may be synthesized by stepwise 
conjugation of monomeric or oligomeric reactants which comprises: 

35 (a) providing a plurality of beads; 

(b) in a plurality of synthetic cycles, stepwise conjugating a monomeric or oligomeric reactant to said 
beads or to a nascent polymeric molecule thereon, where for one or more "structured random" 
synthetic cycles, (i) dividing the beads into N aliquots, (ii) reacting each aliquot with one and only 
one of a set of N different predetermined monomeric or oligomeric reactants, where the value of N 

40 and the reactants of said set may be the same or different for each cycle, and (iii) pooling said 

reacted aliquots said synthetic cycles stepwise forming said molecules through the coupling of a 
reactant of one cycle to the reactant of another cycle. 

3. The method of claim 2, further comprising one or more "structured random" synthetic cycles in each of 
45 which all beads are reacted with a single predetermined mixture of monomeric or oligomeric reactants, 

where said mixture may be the same or different for each such cycle. 

4. The method of claim 3, further comprising one or more "nonrandom" synthetic cycles in which all 
beads are reacted with a purified reactant so as to introduce a constant element into said molecule. 

50 

5. A method of identifying polymeric molecules which bind to a target of interest which comprises: 

(a) providing a first polymeric molecule library according to claim 1 ; 

(b) contacting the first library with the target of interest, under conditions permitting the detection of 
the binding of the target to polymeric molecules carried by a bead of the libraries and selecting 

55 beads carrying polymeric molecules to which said target binds; 

(c) determining at least the familial portion of the sequences of the polymeric molecules carried by a 
selected bead of the first library to which the target binds, 
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(d) providing a second polymeric molecule library according to claim 1 said second library being 
such that essentially all sequences expected to be carried on said selected bead of the first library 
are represented in the molecules of the second library, said second library essentially omitting 
sequences of the first library which were not expected to be carried on the selected bead; 
5 (e) contacting the second library with the target of interest, under conditions permitting the detection 

of the binding of the target to molecules carried by a bead of the libraries and selecting bead 
carrying peptides to which said target binds; 

(f) determining at least the familial portion of the sequences of molecules carried by a selected bead 
of the second library to which the target binds, 
10 whereby a sequence corresponding to a target binding molecule of the first library which was 

carried by said first selected bead is further determined. 

6. The library of claim 1 or the method of any of claims 2-5 wherein the library comprises at least 10 7 
beads. 

75 

7. The library or method of any of claims 1-6 wherein each bead carries at least 10 3 molecules. 

8. The library or method of any of claims 1-7 wherein the assay requires no more than 10 7 , more 
preferably no more than 10 6 , binding molecules per bead for the detection of binding to target, and the 

20 average sampling level per bead for the sequences on the bead is at least about equal to said peptide- 
per-bead detection limit. 

9. The library or method of any of claims 1-8 wherein the assay requires no more than ten more 
preferably two, still more preferably one bead carrying target binding molecules for detection, and the 

25 ratio of the number of beads in the library to the library partitioning factor is at least about equal to said 
bead-per-library detection limit. 

10. The library or method of any of claims 1-9 wherein the size of the library is at least 10 8 , more 
preferably at least 10 10 , still more preferably at least 10 12 , and most preferable at least 10 u molecules. 

30 

11. The library or method of any of claims 1-10 wherein the diversity of the library is at least 10 s , more 
preferably at least 10 9 , still more preferably at least 10 10 , and most preferably at least 10 11 unique 
sequences. 

35 12. The library or method of any of claims 1-11 wherein during at least one random cycle at least forty 
different units are coupled to the nascent molecules of the library. 

13. The library or method of any of claims 1-12 wherein during each random cycle at least forty different 
units are coupled to the nascent molecules of the library. 

40 

14. The library or method of any of claims 1 wherein the molecules are peptides, peptoids, nucleic acids or 
carbohydrates, preferably peptides. 

15. The library or method of claims 1-14 in which the polymers have a length of at least five units. 

45 

16. The library or method of any of claims 1-14 in which the familial portion is one to four units in length. 

17. The library or method of any of claims 1-16 in which the familial portion is identical for all molecules on 
a bead. 

50 

18. The library or method of any of claims 1-16 in which, at at least one monomer position in the familial 
portion, a more difficult-to-sequence monomer unit in a first molecule on a bead is paired with a less 
difficult-to-sequence monomer unit at the corresponding monomer position in a second molecule on the 
same bead. 

55 

19. A method of constructing a library of polymeric molecules which may be synthesized by stepwise 
conjugation of monomeric or oligomeric reactants which comprises 
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(a) providing a support having a surface which is dividable into a plurality of individually selectable 
zones, 

(b) for one or more rounds, 

(i) reacting a first selected zone of the surface of said support with a first selected monomeric or 
5 oligomeric reactant, so that said reactant is coupled with said support, or bound nascent 

polymeric molecules, essentially only within said first selected zone 

(ii) reacting a second selected zone of the surface of said support to a second selected 
monomeric or oligomeric reactant, so that said reactant is coupled with said support, or bound 
nascent polymeric molecules, essentially only within said second selected zone, said first and 

; 0 second zones being nonoverlapping and said first and second reactants being different; and 

(c) for one or more rounds, reacting the entire support surface with a mixture of two or more 
selected monomeric or oligomeric reactants. 

20. The method of claim 19 in which a zone is selected by exposing the surface to radiation through mask 
75 means directing the radiation onto and only onto the selected zone, thereby activating the zone by 
removal of photolabile protecting groups from the irradiated zone. 
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